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Description 

Technical Field 

5 [0001 ] The present invention relates to new isolates of the viral class Hepatitis C. polypeptides, polynucleotides and 
antibodies derived therefrom, as well as the use of such polypeptides, polynucleotides and antibodies in assays (e.g., 
immunoassays, nucleic acid hybridization assays, etc) and in the production of viral polypeptides. 

Background 

10 

[0002] Non-A, Non-B hepatitis (NANBH) is a transmisstole disease or family of diseases that are believed to be viral- 
induced, and that are distinguishable from other forms of viral-associated liver diseases, including that caused by the 
known hepatitis viruses, i.e., hepatitis A virus (HAV). hepatitis B virus (HBV), and delta hepatitis virus (HDV), as well as 
the hepatitis induced by cytomegalovirus (CMV) or Epstein-Ban virus (EBV). NANBH was first identified in transfused 

is individuals. Transmission from man to chimpanzee and serial passage in chimpanzees provided evidence that NANBH 
is due to a transmissible infectious agent or agents. Epidemiologic evidence is suggestive that there may be three types 
of NANBH: the water-borne epidemic type; the blood or needle associated type; and the sporadically occurring (com- 
munity acquired) type. However, until recently, no transmissible agent responsUe for NANBH had not been identified. 
[0003] Clinical diagnosis and identification of NANBH has been accomplished primarily by exclusion of other viral 

20 markers. Among the methods used to detect putative NANBH antigens and antfoodies are agar-gel diffusion, counter- 
immunoelectrophoresis, immunofluorescence microscopy, immune electron microscopy, radioimmunoassay, and 
enzyme-linked irnmunosorbent assay. However, none of these assays has proved to be sufficiently sensitive, specific, 
and reproducible to be used as a diagnostic test for NANBH. 

[0004] Until recently there has been neither darity nor agreement as to the identity or specificity of the antigen anti- 
25 body systems associated with agents of NANBH. It is possHe that NANBH is caused by more than one infectious agent 
and unclear what the serological assays detect in the serum of patients with NANBH. 

[0005] In the past, a number of candidate NANBH agents were postulated. See. ag.. Prince (1983) Ann. Rev. Micro- 
biol. aZ:217; Feinstone & Hoofnagle (1984) New Eng. J. Med. 311:185; Overby (1985) Curr. Heptol. 5:49; Overby 
(1986) Curr. Heptol. fi:65; Overby (1 987) Curr. Heptol. 2:35; and Iwarson (1987) British Med. J. 295:346. However, there 

30 is no proof that any of these candidates represent the etiological agent of NANBH. 

[0006] In 1987, Houghton et al. cloned the first virus definitively linked to NANBH. See. e.g., EPO Pub. No. 318,216; 
Houghton et al.. Science 244359 (1969). Houghton et al. described therein the cloning of an isolate from a new viral 
class, hepatitis C virus (HCV). the prototype isolate described therein being named "HCVr. HCV is a Ravi-like virus, 
with an RNA genome. Houghton et aJ. described the production of recombinant proteins from HCV sequences that are 

35 useful as diagnostic reagents, as well as polynucleotides useful in diagnostic hybridization assays and in the cloning of 
additional HCV isolates. 

[0007] The demand for sensitive, specific methods for screening and identifying earners of NANBH and NANBH con- 
taminated blood or blood products is significant Post-transfusion hepatitis (PTH) occurs in approximately 10% of trans- 
fused patients, and NANBH accounts for up to 90% of these cases. There is a frequent progression to chronic liver 
40 damage (25-55%). 

[0008] Patient care as well as the prevention of transmission of NANBH by blood and blood products or by dose per- 
sonal contact require reliable diagnostic and prognostic tods to detect nucleic acids, antigens and antfoodes related to 
NANBH. In addition, there is also a need for effective vaccines and immunotherapeutic therapeutic agents for the pre- 
vention and/a treatment of the disease 
45 [0009] While at least one HCV isolate has been identified which is useful in meeting the above needs, additional iso- 
lates, particularly those with divergent a genome, may prove to have unique applications. 

Summary of the Invention 

so [0010] New isolates of HCV has been characterized from Japanese blood donors who have been implicated as 
NANBH carriers. These isolates exhibit nucleotide and amino acid sequence heterogeneity with respect to the proto- 
type isolate, HCV1 , in several viral domains. It is believed that these distinct sequences are of in importance, particu- 
larly in diagnostic assays and in vaccine development 

[001 1 ] In one embodiment, the present invention provides a polynucleotide in substantially isolated form comprising 
55 a nucleotide sequence of at least 8 nucleotides from a J-1 HCV isolate, said J-1 HCV isolate having at least 90% nucle- 
otide sequence homology with the J-1 sequence of any one of Figures 7 to 10 or 13 to 18, wherein said nucleotide 
sequence of at least 8 nucleotides is distinct from the nucleotide sequence of HCV isolate HCV-1. 
[0012] In another embodiment of the present invention provides a purified polypeptide comprising a amino acid 
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sequence which: 

(a) is encoded by a nucleotide sequence of the invention, said coding being in frame with the corresponcfing amino 
acid sequences set out in Figures 7 to 10 or 13 to 18; 
5 (b) comprises an antigenic determinant; and 

(c) contains at least one amino acid substitution compared to the amino acids encoded by the corresponding por- 
tion of the HCV-1 sequence. 

[001 3] Yet another embodiment of the present invention provides a polypeptide of the invention immobilized on the 
10 solid support 

[001 4] In a further embodiment of the present invention, an immunoassay tor detecting the presence of anti-HCV anti- 
bodies in a test sample is provided comprising: (a) incubating the test sample under conditions that allow the formation 
of antigen-antibody complaxes with polypeptide of the invention, wherein the polypeptide is not immunologically cross- 
reactive with HCV-1 and (b) detecting any antigen-antibody complexes formed. 

is [0015] Yet another embodiment of the present invention prcvkjes a niethod erf ^ HCV polynucleotides in a test 
sample comprising: (a) providing a polypeptide of the invention as a probe: (b) contacting the test sample and the probe 
under conditions that allow for the formation of a polynucleotide duplex between the probe and its complement in the 
absence of substantial polynucleotide duplex formation between the probe and non-HCV polynucleotide sequences 
present in the test sample; and (c) detecting any polynucleotide duplexes comprising the probe. 

20 [0016] These and other embodiments of the present invention will be readily apparent to this of ordinary skill in the 
art in view of the following description. 

Briftf rifffifirintinn nf thfl Figurm 

25 [0017] 

Figure 1 shows the consensus sequence of the coding strand of a fragment from the J7C/E domain with the het- 
erogeneities. 

Figure 2 shows the consensus sequence of the coding strand of a fragment from the J1 E domain with the hetero- 
30 generties. 

Figure 3 shows the consensus sequence of the coding strand of a fragment of the J1 E/NS1 domain with the het- 
erogeneities. 

Figure 4 shows the consensus sequence of the coding strand of a fragment from the J1 NS3 domain with the het- 
erogeneities. 

35 Figure 5 shews the consensus sequence of the coding strand of a fragment from the J 1 NS5 domain with the het- 
erogeneities. 

Figure 6 shows the homology of the J7 C/E consensus sequence with the nucleotide sequence of the same domain 
fromHCVL 

Figure 7 shews the homology of the J 1 E consensus sequence with the nucleotide sequence of the same domain 
40 fromHCVL 

Figure 8 shows the homology of the J1 E/NS1 consensus sequence with the nucleotide sequence of the same 
domain from HCV1 . 

Figure 9 shows the homology of the J1 NS3 consensus sequence with the nucleotide sequence of the same 
domain from HCV1. 

45 Figure 10 shows the homology of the J1 NS5 consensus sequence with the nucleotide sequence of the same 
domain from HCV1. 

Figure 1 1 shows the putative genomic organization of the HCV1 genome. 

Figure 1 2 shows the nucleotide sequence of the ORF of HCV1 . In the figure nucleotide number 1 is the first A of 
the putative initiating methionine of the large ORF; nucleotides upstream of this nucleotide are numbered with neg- 
50 atrve numbers. 

Figure 13 shows the consensus sequence of the coding strand of a fragment from the J1 NS1 domain (J1 1519) 
with the nucleotide sequence of the same domain from HCV1 . Also shown are the amino acids encoded therein. 
Figure 1 4 shows a composite of the consensus sequence from the core to the NS1 domain of J 1 with the nucleotide 
sequence of the same domain from HCV1 . Also shown are the amino acids encoded therein. 
ss Figure 15 shows a consensus sequence of the coding strand of the NS1 domain of J1. as determined in Example 
IV. Also shown are the nucleotide sequence of the same domain from HCV1 . and the amino acids encoded in the 
HCV1 and J1 sequences. 

Figure 16 shows a consensus sequence of a coding strand of the C200 region of the NS3-NS4 domain ofJ1. Also 
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shewn are the nucleotide sequence of the same domain from HCV1 . Also shown are the amino acids encoded in 
the sequences. 

Figure 1 7 shows a consensus sequence of the coding strand of the NS1 domain of J 1, as determined in Example 
V. Also shown are the nucleotide sequence of the same domain from HCV1 , and the amino acids encoded in the 
5 sequences. 

Figure 18 shews a consensus sequence of the coding strand of the untranslated and core domains of J1. Also 
shewn are the nucleotide sequence of the same domain from HCV1, and the amino acids encoded in the 
sequences. 

10 Detailed Description of the Invention 

[0018] The practice of the present invention will employ, unless otherwise indicated, conventional techniques of 
molecular biology. microbiology, recombinant DNA techniques, and immunology, which are within the skill of the art 
Such techniques are explained fully in the literature. Sfifi e.g., Maniatis, Frtsch & Sambrook. MOLECULAR CLONING; 

15 A LABORATORY MANUAL (1982); DNA CLONING, VOLUMES I AND II (D.N Glover ed. 1985); OLIGONUCLEOTIDE 
SYNTHESIS (M.J. Gait ed. 1984); NUCLEIC ACID HYBRIDIZATION (B.D. Names & S.J. Higgins eds. 1984); TRAN- 
SCRIPTION AND TRANSLATION (aD. Hames & S.J. Higgins eds. 1984); ANIMAL CELL CULTURE (R.I. Freshney ed. 
1986); IMMOBILIZED CELLS AND ENZYMES (IRL Press, 1986); a Perbal, A PRACTICAL GUIDE TO MOLECULAR 
CLONING (1984); the series, METHODS IN ENZYMOLOGY (Academic Press. Inc.); GENE TRANSFER VECTORS 

20 FOR MAMMALIAN CELLS (J.H. Miller and M.R Calos eds. 1987. Cold Spring Harbor Laboratory), Methods in Enzy- 
mology Vol. 154 and Vol. 155 (Wu and Grossman, and Wu. eds.. respectively), Mayer and Walker, eds. (1987). IMMU- 
NOCHEMICAL METHODS IN CELL AND MOLECULAR BIOLOGY (Academic Press. London). Scopes, (1987), 
PROTEIN PURIFICATION: PRINCIPLES AND PRACTICE. Second Edition (Springer-Vertag, N.Y), and HANDBOOK 
OF EXPERIMENTAL IMMUNOLOGY, VOLUMES HV (D.M. Weir and C. C. Biackwell eds 1986). AO patents, patent 

25 applications, and other publications mentioned herein, both supra and infra, are hereby incorporated herein by refer- 
ence. 

[0019] The term "hepatitis C virus" has been reserved by workers in the field for an heretofore unknown otologic 
agent of NANBH. Accordingly, as used herein, "hepatitis C virus" (HCV) refers to an agent causative of NANBH, which 
was formerly referred to as NANBV and/or BB-NANBV from the class of the prototype isolate, HCV1, described by 

30 Houghton et al. See, ag., EPO Pub. No. 318.21 6 and US. Patent App. Serial No 355,002, filed 19 May 1989 (available 
in non-US. applications claiming priority therefrom), the disclosures of which are incorporated herein by referenca The 
nucleotide sequence and putative amino acid sequence of HCV1 is shown in Figure 6. The terms HCV, NANBV, and 
BB-NANBV are used interchangeably herein. As an extension of this terminology, the disease caused by HCV, formerly 
called NANB hepatitis (NANBH), is called hepatitis C. The terms NANBH and hepatitis C may be used interchangeabry 

35 herein. The term "HCV", as used herein, denotes a viral species of which pathogenic strains cause NANBH, as well as 
attenuated strains or defective interfering particles derived therefrom. 

[0020] HCV is a Ravi-like virus. The morphology and composition of Ravi virus particles are known, and are dis- 
cussed by Brinton (1986) THE VIRUSES: THE TOQAVIRIDAE AND FLAVIVIRIDAE (Series eds. Fraenkel-Conrat and 
Wagner, vol eds. Schlesinger and Schfesinger. Plenum Press), p.327-374. Generally, with respect to morphology, Ra- 
40 viviruses contain a central nudeocapsid surrounded by a lipid bi layer. Virions are spherical and have a diameter of 
about 40-50 nm. Their cores are about 25-30 nm in diameter. Along the outer surface of the virion envelope are projec- 
tions that are about 5-1 0 nm long with terminal knobs about 2 nm in diameter. 

[0021] The HCV genome is comprised of RNA. It is known that RNA containing viruses have relatively high rates of 
spontaneous mutation, i.a, reportedly on the order of 10 -3 to to* 4 per incorporated nucleotide. Therefore, there are mul- 

45 tiple strains, which may be virulent or avtrulent within the HCV dass or speciea 

[0022] It is believed that the genome of HCV isolates is comprised of a single ORF of approximately 9,000 nudeotides 
to approximately 1 2,000 nudeotides, encoding a polyprotein similar in size to that of HCV1 , an encoded poiyprotein of 
similar hydrophobic and antigenic character to that of HCV1 , and the presence of co-linear peptide sequences that are 
conserved with HCV1 . In addition, the genome is believed to be a positive-stranded RNA. 

so [0023] Isolates of HCV comprise epitopes that are immunologicaliy cross-reactive with epitopes in the HCV1 genome. 
At least some of these are epitopes unique to HCV when compared to other known Ravrviruses. The uniqueness of the 
epitope may be determined by its immunological reactivity with anti-HCV antibodies and lack of immunological reactivity 
with antibodies to other Ravivirus spedea Methods for determining immunological reactivity are known in the art for 
example, by radioirnrnunoassay, by ELISA assay, by hemagglutination, and several examples of suitable techniques for 

55 assays are provided herein. 

[0024] It is also expected that the overall homology of HCV isolates and HCV1 genomes at the nudeotide level prob- 
ably will be about 40% or greater, probably about 60% or greater, and even more probably about 80% to about 90% or 
greater. In addrtion that there are many corresponding contiguous sequences of at least about 13 nudeotides that are 
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fully homologous. The correspondence between the sequence from a new isolate and the HCV1 sequence can be 
determined by techniques known in the art For example, they can be determined by a direct comparison of the 
sequence information of the polynucleotide from the new isolate and HCV1 sequences. Alternatively, homology can be 
determined by hybridization of the polynucleotides under conditions which form stable duplexes between homologous 
5 regions (for example, those which would be used prior to Si digestion), followed by digestion with single-stranded spe- 
cific nudease($), followed by size determination of the digested fragments. 

[0025] Because of the evolutionary relationship of the strains or isolates of HCV, putative HCV strains or isolates are 
identifiable by their homology at the polypeptide level. Thus, new HCV isolates are expected to be more than about 40% 
homologous, probably more than about 70% homologous, and even more probably more than about 80% homologous, 
io and possWy even more than about 90% homologous at the polypeptide level. The techniques for determining amino 
acid sequence homology are known in the art For example, the amino acid sequence may be determined directly and 
compared to the sequences provided herein. Alternatively the nucleotide sequence of the genomic material of the puta- 
tive HCV may be determined, the amino acid sequence encoded therein can be determined, and the corresponding 
regions compared. 

is [0026] The ORF of HCV1 is shown in Figure 12. The non-structural, core, and envelope domains of the polyprotein 
have been predicted for HCV1 (Figure 5). The "C". or core, polypeptide is believed to be encoded from the 5' terminus 
to about nucleotide 345 of HCV1 . The putative "E", or envelope, domain of HCV1 is believed to be encoded from about 
nucleotide 346 to about nucleotide 1050. Putative NS1, or non-structural one domain, is thought to be encoded from 
about nucleotide 1051 to about nucleotide 1953. For the remaining domains, putative NS2 is thought to be encoded 

20 from about nucleotide 1954 to about nucleotide 3018, putative NS3 from about nucleotide 3019 to about nucleotide 
4950, putative NS4 from about nucleotide 4951 to about nucleotide 6297, and putative NS5 from about nucleotide 6298 
to the 3' terminus respectively. The above boundaries are approximations based on an analysis of the ORF. The exact 
boundaries can be determined by those skilled in the art in view of the disclosure herein. 

[0027] "HCV/jr or Mr and "HCV/jr or "J7" refer to new HCV isolates characterized by the nucleotide sequence - 
25 disclosed herein, as well as related isolates that are substantially homologous thereto; La, at least about 90% or about 
95% at the nucleotide level. It is believed that the sequences disclosed herein characterize an HCV subclass that is pre- 
dominant in Japan and other Asian and/or Pacific rim countries. Additional J1 and J7 isolates can be obtained in view 
of the disclosure herein and EPO Pub. No. 318,21 6. In particular, the J1 and J7 nucleotide sequences disclosed herein, 
as well as the HCV1 sequences in Figure 12, can be used as primers or probes to clone additional domains of J 1. J7, 
so or additional isolates. 

[0028] As used herein, a nucleotide sequence "from* a designated sequence or source refers to a nucleotide 
sequence that is homologous (i.e., identical) to or complementary to the designated sequence or source, or a portion 
thereof. The J1 sequences provided herein are a minimum of 15 nucleotides preferably 20 nucleotides or longer. The 
maximum length is the complete viral genome. 

35 [0029] In some aspects of the invention, the sequence of the region from which the polynucleotide is derived is pref- 
erably homologous to or complementary to a sequence which is unique to an HCV genome or the J 1 genome. Whether 
or not a sequence is unique to a genome can be determined by techniques known to those of skill in the art For exam- 
ple, the sequence can be compared to sequences in databanks, e.g., Qenebank, to determine whether it is present in 
the uninfected host or other organisms. The sequence can also be compared to the known sequences of other viral 

40 agents, including those which are known to induce hepatitis, e.g., HAV, HBV, and HDV, and to other members of the Fla- 
viviridae. The correspondence or non-correspondence of the derived sequence to other sequences can also be deter- 
mined by hybridization under the appropriate stringency conditions. Hybridization techniques for determining the 
complementarity of nucleic acid sequences are known in the art See also, for example, Maniatis et a). (1982) MOLEC- 
ULAR CLONING; A LABORATORY MANUAL (Cold Spring Harbor Press, Cold Spring Harbor, N.Y). In addition, mis- 

45 matches of duplex polynucleotides formed by hybridization can be determined by known techniques, inducing for 
example, digestion with a nuclease such as S1 that specifically digests single-stranded areas in duplex polynucleotides. 
Regions from which typical DNA sequences may be derived include, but are not limited to. regions encoding specific 
epitopes, as well as non-transcribed and/or non-translated regions. 

[0030] The J1 polynucleotide is not necessarily physically derived from the nucleotide sequence shown, but may be 
so generated in any manner, including for example, chemical synthesis or DNA replication or reverse transcription or tran- 
scription. In addition, combinations of regions corresponding to that of the designated sequence may be modified in 
ways known in the art to be consistent with an intended use. The polynucleotides may also include one or more labels, 
which are known to those of still in the art 

[0031 ] An amino acid sequence "from" a designated polypeptide or source of polypeptides means that the amino acid 
55 sequence is homologous (i.e., identical) to the sequence of the designated polypeptide, or a portion thereof. An amino 
acid sequence "from" a designated nucleic acid sequence refers to a polypeptide having an amino acid sequence iden- 
tical to that of a polypeptide encoded in the sequence, or a portion thereof. The J1 amino acid sequences in the 
polypeptides of the present invention are at least 10 amino acids, more preferably at least about 15 amino acids, and 
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most preferably at least about 20 amino acids. 

[0032] The polypeptides of the present invention are not necessarily translated from a designated nucleic acid 
sequence; the polypeptides may be generated in any manner, inducting for example, chemical synthesis, or expression 
of a recombinant expression system, or isolation from virus. The polypeptides may include one or more analogs of 
amino acids or unnatural amino acids. Methods of inserting analogs of amino acids into a sequence are known in the 
art The polypeptides may also include one or more labels, which are Known to those of skill in the art 
[0033] The term "recombinant polynucleotide" as used herein intends a polynucleotide of genomic, cDNA, semisyn- 
thetic or synthetic origin which, by virtue of its origin or manipulation: (1) is linked to a polynucleotide other than that to 
which it is linked in nature, or (2) does not occur in nature. 

[0034] The term "polynucleotide" as used herein refers to a polymeric form of nucleotides of any length, either rtoo- 
nudeotides or deoocyribonudeotidea This term refers only to the primary structure of the molecuia Thus, this term 
includes double- and single-stranded DNA, and RNA. It also includes known types of modifications, for example, labels 
which are known in the art methyiation, "caps", substitution of one or more of the naturally occurring nucleotides with 
an analog, inter nucleotide modifications such as, for example, th06e with uncharged linkages (ag., methyl phospho- 
nates, p h o sp hotriesters, phosphoamidates, carbamates, etc.) and with charged linkages (e.g.. phosphorothioates, 
phosphorodithioates. eta), those containing pendant moieties, such as, for example proteins (including for e.g., nucle- 
ases, toxins, antfoodies. signal peptides, pory-L-tysine, etc.), those with intercalators (e.g., acridine, psoralen, etc.), 
those containing chelators (e.g., metals, radioactive metals, boron, oxidative metals, etc.), those containing atkytators, 
those with modified lintages (e.g.. alpha anomeric nucleic acids, etc.), as well as unmodified forms of the polynude- 
otida 

[0035] "Purified polynucleotide" refers to a corrposrtion comprising a specified polynucleotide that is substantially free 
of other components, such composition typically comprising at least about 70% of the specified polynucleotide, more 
typically at least about 80%. 90% or even 95% to 99% of the specified polynucleotide. 

[0036] "Purified polypeptide" refers to a composition corrpristng a specified polypeptide that is substantially free of 
other components, such composition typically corrprising at least about 70% of the specified polypeptide, more typi- 
cally at least about 80%, 90% or even 95% to 99% of the specified polypeptide 

[0037] "Recombinant host cells", "host cells", "cells", "cell lines", "ceO cultures", and other such terms denote micro- 
organisms or higher eukaryotic cell lines cultured as unicellular entities that can be, or have been, used as recipients 
for a recombinant vector or other transfer DNA. and include the progeny of the original cell which has been transformed. 
It is understood that the progeny of a single parental cell may not necessarily be completely identical in morphology or 
in genomic or total DNA corrplement as the original parent due to natural, accidental, or delfoerate mutation. 
[0038] A "replicon" is any genetic element e.g., a plasmid, a chromosome, a virus, a cosmid, eta that behaves as an 
autonomous unit of polynucleotide replication within a cell; i.a, capable of replication under its own control. 
[0039] A "cloning vector" is a replicon that can transform a selected host cell and in which another polynucleotide seg- 
ment is attached, so as to bring about the replication and/or expression of the attached segment Typically, cloning vec- 
tors include plasnids, virus (e.g., bacteriophage vector) and cosmids. 

[0040] An Integrating vector" is a vector that does not behave as a replicon in a selected host cell, but has the ability 
to integrate into a replicon (typically a chromosome) resident in the selected host to stably transform the host 
[0041] An "expression vector* is a construct that can transform a selected host cell and provides for expression of a 
heterologous coding sequence in the selected host Expression vectors can be either a cloning vector or an integrating 
vector. 

[0042] A "coding sequence" is a polynucleotide sequence which is transcribed into mRNA and/or translated into a 
polypeptide when placed under the control of appropriate regulatory sequences. The boundaries of the coding 
sequence are determined by a translation start codon at the 5Merminus and a translation stop codon at the 3Mer minus. 
A coding sequence can include, but is not limited to mRNA, cDNA. and recombinant polynucleotide sequences^ 
[0043] "Control sequence" refers to polynucleotide regulatory sequences which are necessary to effect the expres- 
sion of coding sequences to which they are ligated. The nature of such control sequences differs depending upon the 
host organism. In prokaryotes, control sequences generally include promoter, ribosomal binding site, and terminators. 
In eukaryotes generally control sequences include promoters, terminators and, in some instances, enhancers. The 
term "control sequences" is intended to include, at a minimum, all components the presence of which are necessary for 
expression, and may also include additional advantageous components. 

[0044] "Operabty linked" refers to a juxtaposition wherein the components so described are in a relationship permit- 
ting them to function in their intended manner. A control sequence "operably linked" to a coding sequence is ligated in 
such a way that expression of the coding sequence is achieved under conctfons compatible with the control 
sequences. 

[0045] An "open reading frame" or ORF is a region of a polynucleotide sequence which encodes a polypeptide; this 
region may represent a portion of a coding sequence or a total coding sequenca 

[0046] "Immunologically cross-reactive" refers to two or more epitopes or polypeptides that are bound by the same 
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antibody. Cross-reactivity can be determined by any of a number of immunoassay techniques, such as a competition 
assay. 

[0047] As used herein, the term "anttoody" refers to a polypeptide or group of polypeptides which comprise at least 
one epitope. An "antigen binding site" is formed from the folding of the variable domains of an anttoody molecule(s) to 

5 form three-dimensional binding sites with an internal surface shape and charge distribution complementary to the fea- 
tures of an epitope of an antigen, which allows specific binding to form an antibody-antigen complex. An antigen binding 
site may be formed from a heavy- and/or light-chain domain (VH and VU respectively), which form hypervariaUe loops 
which contribute to antigen binding. The term "antibody" includes, without limitation, chimeric antibodies, altered anti- 
bodies, univalent antibodies, Fab proteins, and single-domain antibodies. In many cases, the biding phenomena of anti- 

10 bodies to antigens is equivalent to other ligand/anti-ligand binding. 

[0048] As used herein, a "single domain antibody" (dAb) is an anttoody which is comprised of an HL domain, which 
binds specifically with a designated antigen. A dAb does not contain a VL domain, but may contain other antigen bind- 
ing ck>rnains known to exist to antibodies, for example, the kappa and lambda domains. Methods for preparing dAbs are 
known in the art. See, for example, Ward et al, Nature Ml: 544 (1 989). 

15 [0049] Anttoodies may also be comprised of VH and VL domains, as well as other known antigen binding domains. 
Examples of these types of anttoodies and methods for their preparation and known in the art (see. ag., US. Patent 
No. 4,816,467, which is incorporated herein by reference), and include the following. For example. Vertebrate embod- 
ies" refers to anttoodies which are tetramers or aggregates thereof, comprising Hght and heavy chains which are usually 
aggregated in a "Y" configuration and which may or may not have covaJent linkages between the chains. In vertebrate 

20 antibodies, the amino acid sequences of the chains are homologous with those sequences found in antibodies pro- 
duced in vertebrates, whether in situ or in vitro (for example, in hybridomas). Vertebrate antibodies include, for example, 
purified polyclonal antibodies and monoclonal antbodies, methods for the preparation of which are described infra 
[0050] "Hybrid anttoodies" are antibodies where chains are separately homologous with reference to mammalian anti- 
body chains and represent novel assemblies of them, so that two different antigens are preciprtabie by the tetramer or 

25 aggregate In hybrid anttoodies, one pair of heavy and light chains are homologous to those found in an antibody raised 
against a first antigen, while a second pair of chains are homologous to those found in an antibody raised against a sec- 
ond antibody. This results in the property of "divalence", i.a, the ability to bind two antigens simuttaneousJy. Such 
hybrids may also be formed using chimeric chains, as set forth below. 

[0051] "Chimeric antibodies" refers to anttoodies in which the heavy and/br light chains are fusion proteins. Typically, 
30 one portion of the amino acid sequences of the chain is homologous to corresponding sequences in an antibody 
derived from a particular species or a particular class, while the remaining segment of the chain is homologous to the 
sequences derived from another species and/or class. Usually, the variable region of both light and heavy chains mim- 
ics the variable regions or anttoodies derived from one species of vertebrates, while the constant portions are homolo- 
gous to the sequences in the antibodies derived from another species of vertebrates. However, the definition is not 
35 limited to this particular example Also included is any antibody in which either or both of the heavy or light chains are 
composed of combinations of sequences mimicking the sequences in antibodies of differ em sources, whether these 
sources be from drffering classes or different species of origin, and whether or not the fusion point is at the variable/con- 
stant boundary. Thus, it is possible to produce antibodies in which neither the constant nor the variable region mimic 
know antibody sequences. It then becomes posstole, for example, to construct anttoodies whose variable region has a 
40 higher specific affinity for a particular antigen, or whose constant region can elicit enhanced complement fixation, or to 
make other improvements in properties possessed by a particular constant region. 

[0052] Another example is "altered antibodies", which refers to anttoodies in which the naturally occurring amino acid 
sequence in a vertebrate antibody has been varies. Utilizing recombinant ONA techniques, antibodies can be rede- 
signed to obtain desired characteristics. The posstole variations are many, and range from the changing of one or more 

45 amino acids to the complete redesign of a region, tor example, the constant region. Changes in the constant region, in 
general, to attain desired cellular process characteristics, e.g., changes in complement fixation, interaction with mem- 
branes, and other effector functions. Changes in the variable region may be made to alter antigen binding characteris- 
tics. The antibody may also be engineered to aid the specific delivery of a molecule or substance to a specific cell or 
tissue site The desired alterations may be made by known techniques in molecular biology, ag., recombinant tech- 

so niques, site-directed mutagenesis, etc. 

[0053] Yet another example are "univalent antibodies", which are aggregates comprised of a heavy-chain/light-chain 
dimer bound to the Fc (i.e., stem) region of a second heavy chain. This type of antibody escapes antigenic modulation. 
See, e.g., GJermie et al. Nature 295 : 712 (1982). Included also within the definition of anttoodies are "Fab" fragments of 
antibodies. The "Fab" region refers to those portions of the heavy and light chains which are roughly equivalent, or anal- 

55 ogous, to the sequences which comprise the branch portion of the heavy and light chains, and which have been shown 
to exhibit immunological binding to a specified antigen, but which lack the effector Fc portion. "Fab" includes aggregates 
of one heavy and one light chain (commonly known as Fab 1 ), as well as tetramers containing the 2H and 2L chains 
(referred to as F(ab)2). which are capable of selectively reacting with a designated antigen or antigen family. Fab anti- 
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bodies may be divided into subsets analogous to those described above, La. Vertebrate Fab", "hybrid Fab", "chimeric 
Fab", and "altered Fab". Methods of producing Fab fragments of antibodies are known within the art and include, for 
example, proteolysis, and synthesis by recombinant techniques. 

[0054] "Epitope" refers to an antibody binding site usually defined by a polypeptide, but also by non-amino acid hap- 
5 tens. An epitope could comprise 3 amino acids in a spatial conformation which is unique to the epitope, generally an 
epitope consists of at least 5 such amino acids, and more usually, consists of at least 8-10 such amino acids. 
[0055] "Antigen-antibody complex" refers to the complex formed by an antibody that is specifically bound to an epitope 
on an antigen. 

[0056] "Immunogenic polypeptide" refers to a polypeptide that elicits a cellular and/or humoral irrmune response in a 
10 mammal, whether alone or linked to a carrier, in the presence or absence of an adjuvant 

[0057] "Polypeptide" refers to a polymer of amino acids and does not refer to a specific length of the molecula Thus, 
peptides, oligopeptides, and proteins are included within the definition of polypeptide This term also does not refer to 
or exclude post-expression modifications of the polypeptide, for example, gfycosylations, acetylations, phosphoryla- 
tions and the like. Included within the definition are. for exarrple. polypeptides containing one or more analogs of an 
is amino acid (including, for example, unnatural amino acids, etc.). polypeptides with substituted linkages, as well as other 
modifications known in the art both naturally occurring and non-naturally occurring. 

[0058] "Transformation", as used herein, refers to the insertion of an exogenous polynucleotide into a host cell, irre- 
spective of the method used for the insertion, for exarrple, direct uptake, transduction, f-mating or electroporabon. The 
exogenous polynucleotide may be maintained as a non-integrated vector, for example, a plasmid. or alternatively, may 
20 be integrated into the host genome. 

[0059] A "transformed" host cell refers to both the immediate cell that has undergone transformation and its progeny 

that maintain the originally exogenous polynucleotide. 

[0060] Treatment" as used herein refers to prophylaxis andfor therapy. 

[0061] "Individual", refers to vertebrates, particularly members of the mammalian species, and includes but is not lim- 

2S rted to domestic animals, sports animals, and primates, including humans. 

[0062] "Sense strand" refers to the strand of a double-stranded DNA molecule that is homologous to a mRNA tran- 
script thereof. The "anti-sense strand" contains a sequence which is complementary to that of the "sense strand". 
[0063] "Antfoody-containing body component" refers to a component of an individual's body which is a source of the 
antibodies of interest Antibody-containing body components are known in the art and include but are not limited to, 

30 whole blood and components thereof, plasma, serum, spinal fluid, lymph fluid, the external sections of the respiratory, 
intestinal, and genitourinary tracts, tears, saliva, milk, white blood cells, and myelomas. 

[0064] "Purified HCV" isolate refers to a preparation of HCV particles which has been isolated from the cellular con- 
stituents with which the virus is normally associated, and from other types of viruses which may be present in the 
infected tissue. The techniques for isolating viruses are known to those of skill in the art and include, for example, cen- 
35 trifugation and affinity chro ma togr ap hy. 

[0065] An HCV "particle" is an entire virion, as well as particles which are intermediates in virion formation. HCV par- 
ticles generally have one or more HCV proteins associated with the HCV nucleic acid. 

[0066] "Probe" refers to a polynucleotide which forms a hybrid structure with a sequence in a target polynucleotide, 
due to a complementarity of at least one region in the probe with a region in the target 

40 [0067] "Biological sample" refers to a sample of tissue or fluid isolated from an individual, including but not limited to, 
for example, whole blood and components thereof, plasma, serum, spinal fluid, the external sections of the skin, respi- 
ratory, intestinal, and genitourinary tracts, tears saliva, milk, blood cells, tumors, organs, and also samples of in vitro cell 
culture constituents (including but not limited to condrtioned medium resulting from the growth of cells in cell culture 
medium, putatively viral ry infected cells, recombinant cells, and cell components). 

45 [0068] The invention pertains to the isolation and characterization of a newly discovered isolate of HCV. J 1 , its nucle- 
otide sequence, its protein sequence and resulting polynucleotides and polypeptides and antibodies derived therefrom. 
However, the isolation and characterization of another newly discovered isolate of HCV, J7, is also described herein, 
although it does not form part of the invention claimed. Isolates J1 and J7 are novel in their nucleotide and amino acid 
sequences, and are believed to be characteristic of HCV isolates from Japan and other Asian countries. 

so [0069] The nucleotide sequences derived from HCVJ1 are useful as probes to dagnose the presence of vims in sam- 
ples, and to isolate other naturally occurring variants of the virus. These nucleotide sequences also make available 
polypeptide sequences of HCV antigens encoded within the J1 genome and permits the production of polypeptides 
which are useful as standards or reagents in diagnostic tests and/or as components or vaccines. Antfoodies. both pol- 
yclonal and monoclonal, directed against HCV epitodes contained within these polypeptide sequences are also useful 

55 for diagnostic tests, as therapeutic agents, for screening of antiviral agents, and for the isolation of the NAN8H virus. In 
addition, by utilizing probes derived from the sequences disclosed herein it is possible to isolate and sequence other 
portions of the J1 genome, thus giving rise to addrtionaJ probes and polypeptides which are useful in the diagnosis 
and/or treatment both prophylactic and therapeutic, of NANBH. 
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[0070] The availability of the HCV/J1 nucleotide sequences enable the construction of polynucleotide probes and 
polypeptides useful in diagnosing NANBH due to HCV infection and in screening blood donors as well as donated blood 
and blood products for infection. For example, from the sequences it is possHe to synthesize DNA oligomers of about 
8-10 nucleotides, or larger, which are useful as hybridization probes to detect the presence of HCV RNA in, tor exarrple, 

5 sera of subjects suspected of harboring the virus, or tor screening donated blood tor the presence of the virus. The 
HCV/J1 sequences also allow the design and production of HCV specific polypeptides which are useful as diagnostic 
reagents for the presence of antibodies raised during NANBH. Antibodies to purified polypeptides derived from the 
HCV/J1 sequences may also be used to detect viral antigens in infected individuals and in blood. 
[0071] Knowledge of these HCV/J1 sequences also enable she design and production of polypeptides which may be 

io used as vaccines against HCV and also for the production of antibodies, which in turn may be used for protection 
against the disease, and/or tor therapy of HCV infected individuals. Moreover, the disclosed HCV/J1 sequences enable 
further characterization of the HCV genome. Polynucleotide probes derived from these sequences, as well as from the 
HCV genome, may be used to screen cONA ibraries for additional viral cONA sequences, which, in turn, may be used 
to obtain additional overlapping sequences. See. e.g.. EPO Pub. Na 318,216. 

is [0072] The HCV/J1 polynucleotide sequences, the polypeptides derived therefrom and the antibodies directed 
against these polypeptides, are useful in the isolation and identification of the BB-NANBV agentfs). For example, anti- 
bodies directed against HCV epitopes contained in polypeptides derived from the HCV/J1 sequences may be used in 
processes based upon affinity chromatography to isolate the virus. Alternatively, the antibocfies may be used to identify 
viral particles isolated by other techniques. The viral antigens and the genomic material within the isolated viral particles 

20 may then be further characterized. 

[007?] The information obtained from further sequencing of the HCV/J1 genome, as well as from further characteri- 
zation of the HCV/J1 antigens and characterization of the genomes enable the design and synthesis of additional 
probes and polypeptides and antibocfies which may be used tor diagnosis, for prevention, and for therapy of HCV 
induced NANBH, and for screening for infected blood and blood-related products. 

25 [0074] The availability of HCV/J 1 cDNA sequences permits the construction of expression vectors encoding antigen- 
icalty active regions of the polypeptide encoded in either strand. These antigenically active regions may be derived from 
coat or envelope antigens or from core antigens, or from antigens which are non-structural including, for example, poly- 
nucleotide binding proteins, polynucleotide polymerase^), and other viral proteins required for the replication and/or 
assembly of the virus partida Fragments encocfing the desired polypeptides are derived from the cONA clones using 

30 conventional restriction digestion or by synthetic methods, and are ligated into vectors which may, for example, contain 
portions of fusion sequences such as beta-galactosida&e or superoxide dismutase (SOD). Methods and vectors which 
are useful for the production of polypeptides which contain fusion sequences of SOD are described in EPO Pub. No. 
196,056. Vectors encoding fusion polypeptides of SOD and HCV polypeptides are described in EPO Pub. No. 318,216. 
Any desired portion of the HCV cDNA containing an open reading frame, in either sense strand, can be obtained as a 

35 recombinant polypeptide, such as a mature or fusion protein. Alternatively, a polypeptide encoded in the cDNA can be 
provided by chemical synthesis. 

[0075] The DNA encoding the desired polypeptide, whether in fused or mature form, and whether or not containing a 
signal sequence to permit secretion, may be ligated into expression vectors suitable for any convenient host Both 
eukaryotic and prokaryotic host systems are presently used in forming recombinant polypeptides, and a summary of 

40 some of the more common control systems and host cell is given below. The polypeptide produced in such host cells 
is then isolated from lysed cells or from the culture medium and purified to the extent needed tor its intended use. Puri- 
fication may be by techniques known in the art, tor example, differential extraction, salt fractionation, chromatography 
on ion exchange resins, affinity chromatography, centrifugation, and the like. See, tor example, Methods in Enzvmoloov 
for a variety of methods for purifying proteins. 

45 [0076] Such recombinant or synthetic HCV polypeptides can be used as diagnostics, or those which give rise to neu- 
tralizing antfoodies may be formulated into vaccines. Antibodies raised against these polypeptides can also be used as 
diagnostics, or for passive immunotherapy. In addition, antibodies to these polypeptides are useful for isolating and 
identifying HCV particles. 

[0077] The HCV antigens may also be isolated from HCV virions. The virions may be grown in HCV infected cells in 

so tissue culture, or in an infected host 

[0078] While the polypeptides of the present invention may comprise a substantially complete viral domain, in many 
applications all that is required is that the polypeptide comprise an antigenic or immunogenic region of the virus. An 
antigenic region of a polypeptide is generally relatively smaJMypically 8 to 10 amino acids or less in length. Fragments 
of as few as 5 amino acids may characterize an antigenic region. These segments may correspond to regions of 

55 HCV/J 1 epitopes. Accordingly, using the cDNAs of HCV/J1 as a basis. DNAs encoding short segments of HCV/J 1 
polypeptides can be expressed recombinant either as fusion proteins, or as isolated polypeptides. In adcfition, short 
amino acid sequences can be conveniently obtained by chemical synthesis. 

[0079] In instances wherein the synthesized polypeptide is correctly configured so as to provide the correct epitope. 
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but is too small to be immunogenic, the polypeptide may be linked to a suitable earner. A number of techniques for 
obtaining such linkage are known in the art, including the formation of disulfide linkages using N-succinimidyi-3-(2-pyri- 
dyl-thio)propionate (SPDP) and succinimidyl 4-(N-rnaleimido-methy0cyctohe^ (SMCC) obtained 

from Pierce Company, Rocktord, Illinois, frf the peptide lacks a sulfhydryl group, this can be provided by addition of a 

5 cysteine residua) These reagents create a disulfide linkage between themselves and peptide cysteine residues on one 
protein and an amide linkage through the epsilon-amino on a lysine, or other free amino group in the other. A variety of 
such disutfide/aiTiide-forming agents are known. See, for example, Immun. Rev. (1982) $g:185. Other Afunctional cou- 
pling agents form a thioether rather than a disulfide linkage. Many of these thio-ether-forming agents are commercially 
available and include reactive esters of 6-maJeimidocaproic acid, 2-bromoacetic acid, 2-iodoacetic acid, 4-(N-maleim- 

w ido-methyOcyclohexane-1 -carbcxylic acid, and the like. The carboxyl groups can be activated by combining them with 
sucdrtimide or 1-hydroxyl-2-nitro-4-sulfcnic acid, sodium salt Additional methods of coupling antigens employs the 
rotavirusfbincfing peptide" system described in EPO Pub. No. 259.149, the disclosure of which is incorporated herein 
by reference The foregoing list is not meant to be exhaustive, and modifications of the named compounds can clearly 
be used. 

is [0080] Any carrier may be used which does not itself induce the production of antibodies harmful to the host Suitable 
carriers are typically large, slowly metabolized macromolecules such as proteins; polysaccharides, such as latex func- 
tionalized sepharose, agarose, cellulose, cellulose beads and the like; polymeric amino acids, such as polyglutamic 
acid, polylysine, and the like; amino acid copolymers; and inactive virus particles. Especially useful protein substrates 
are serum aJbumins, keyhole limpet hemocyanin, immunoglobulin molecules, thyroglobulin, ovalbumin, tetanus toxoid. 

20 and other proteins well known to those skilled in the art. 

[0061 ] In addrtion to full-length viral proteins, polypeptides comprising truncated HCV amino acid sequences encod- 
ing at least one viral epitope are useful immunological reagents. For example, polypeptides comprising such truncated 
sequences can be used as reagents in an immunoassay. These polypeptides also are candidate subunit antigens in 
compositions for antiserum production or vaccines. While these truncated sequences can be produced by various 

25 known treatments of native viral protein, it is generally preferred to make synthetic or recombinant polypeptides com- 
prising an HCV sequenca Polypeptides comprising these truncated HCV sequences can be made up entirely of HCV 
sequences (one or more epitopes, either contiguous or noncontiguous), or HCV sequences and heterologous 
sequences in a fusion protein. Useful heterologous sequences include sequences that provide for secretion from a 
recombinant host, enhance the immunological reactivity of the HCV eprtope(s), or facilitate the coupling of the porypep- 

30 tide to an immunoassay support or a vaccine carrier. See, e.g.. EPO Pub. No. 1 1 6.201 ; U.S. Pat. No. 4,722,840; EPO 
Pub. No. 259, 1 49; U.S. Pat No. 4.629,783, the disclosures of which are incorporated herein by reference 
[0062] The size of polypeptides comprising the truncated HCV sequences can vary widely, the minimum size being 
a sequence of sufficient size to provide an HCV epitope, while the maximum size is not critical. In some applications, 
the maximum size usually is not substantially greater than that required to provide the desired HCV epitopes and func- 

35 tion(s) of the heterologous sequence, if any. Typically, the truncated HCV amino acid sequence will range from about 5 
to about 100 amino acids in length. More typically, however, the HCV sequence will be a maximum of about 50 amino 
acids in length, preferably a maximum of about 30 amino acids. It is usually desirable to select HCV sequences of at 
least about 10, 12 or 15 amino acids, up to a maximum of about 20 or 25 amino acids. 

[0083] Truncated HCV amino acid sequences comprising epitopes can be identified in a number of ways. For axam- 
40 pie, the entire viral protein sequence can be screened by preparing a series of short peptides that together span the 
entire protein sequenca By starting with, for example, 100-mer polypeptides, it would be routine to test each polypep- 
tide for the presence of epitope(s) showing a desired reactivity, and then testing progressively smaller and overlapping 
fragments from an identified 100-mer to map the epitope of interest. Screening such peptides in an immunoassay is 
within the skill of the art It is also known to carry out a computer analysis of a protein sequence to identify potential 
45 epitopes, and then prepare oligopeptides comprising the identified regions for screening. It is appreciated by those of 
skill in the art that such computer analysis of antigenicity does not always identify an epitope that actually exists, and 
can also incorrectly identify a region of the protein as containing an epitope. 

[0064] The observed relationship of the putative pofyproteins of HCV and the Raviviruses allows a prediction of the 
putative domains of the HCV "non-structuraJ" (NS) proteins. The locations of the individual NS proteins in the putative 

so Ravivirus precursor polyprotein are fairly well-known. Moreover, these also coincide with observed gross fluctuations in 
the hydrophobicity proffle of the polyprotein. It is established that NS5 of Raviviruses encodes the virion polymerase, 
and that NSI corresponds with a complement fixation antigen which has been shown to be an effective vaccine in ani- 
mals. Recently, it has been shown that a flavKriraJ protease function resides in NS3. Due to the observed similarities 
between HCV and the Raviviruses, deductions concerning the approximate locations of the corresponding protein 

55 domains and functions in the HCV polyprotein are possible. Rgure 1 1 is a schematic of putative domains of the HCV 
polyprotein. The expression of polypeptides containing these domains in a variety of recombinant host cells, including, 
for example, bacteria, yeast, insect and vertebrate cells, should give rise to important immunological reagents which 
can be used for diagnosis, detection, and vaccines. 
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[0085] Although the non-structural protein region of the putative polyproteins of the HCV isolate described herein and 
of Ravi viruses appears to be generally simitar, there is less similarity between the putative structural regions which are 
towards the N-terminus. In this region, there is a greater divergence in sequence, and in addition, the hydrophobic pro- 
file of the two regions shew less similarity. This "divergence" begins in the N-terminal region of the putative NS1 domain 

5 in HCV, and extends to the presumed N-terminus. Nevertheless, it is still possible to predict the approximate locations 
of the putative nudeocapskj (N-terminal basic domain) and E (generally hydrophobic) domains within the HCV poiypro- 
tein. From these predictions it may be possible to identify approximate regions of the HCV polyprotein that could corre- 
spond with useful immunological reagents. For example, the E and NS1 proteins of Flaviviruses are known to have 
efficacy as protective vaccines. These regions, as well as some which are shown to be antigenic in the HCV1 , tor axam- 

10 pie those within putative NS3, C. and NS5, etc, should also provide diagnostic reagents. 

[0066] The immunogenicrty of the HCV sequences may also be enhanced by preparing the sequences fused to or 
assembled with particle-forming proteins such as, for example, hepatitis B surface antigen or rotavirus VP6 antigen. 
Constructs wherein the HCV epitope is linked directly to the partide-forming protein coding sequences produce hybrids 
which are immunogenic with respect to the HCV epitope. In adtition, all of the vectors prepared include epitopes spa- 
rs crfic to HBV, having various degrees of imrnunogenicity, such as, for example, the pre-S peptida Thus, particles con- 
structed from particle forming protein which include HCV sequences are immunogenic with respect to HCV and 
particle-form protein. See, e.g., U.S. Pat. No. 4,722,840; EPO Pub Na 175,261; EPO Pub. Na 259,149; MicheOe etal. 
(1984) Int Symposium on Viral Hepatitis. 

[0087] Vaccines may be prepared from one or more immunogenic polypeptides derived from HCV/J1. The observed 
20 homology between HCV and Flaviviruses provides information concerning the polypeptides which are likely to be most 
effective as vaccines, as well as the regions of the genome in which they are encoded. The general structure of the Fla- 
vivirus genome is discussed in Rice et al. (1986) in THE VIRUSES: THE TOGAVIRIDAE AND FLAVIVIRIDAE (Series 
eds. Fraenkel-Conrat and Wagner, Vol eds. Schlesinger and Schlesinger, Plenum Press). The flavrvirus genomic RNA 
is believed to be the onfy virus-specific mRNA species, and it is translated into the three viral structural proteins, i.e., C. - 
25 M, and E, as well as two large nonstructural proteins, NV4 and NV5, and a complex set of smaller nonstructural pro- 
teins, ft is known that major neutralizing epitopes tor Flaviviruses reside in the E (envelope) protein. Roehrig (1986) in 
THE VIRUSES: THE TOGAVIRIDAE AND FLAVIVIRIDAE (Series eds. Fraenkel-Conrat and Wagner, Vol eds. Schles- 
inger and Schlesinger, Plenum Press). The corresponding HCV E gene and polypeptide encoding region may be pre- 
dicted, based upon the homology to Flaviviruses. Thus, vaccines may be comprised of recombinant polypeptides 
30 containing epitopes of HCV E. These polypeptides may be expressed in bacteria, yeast or mammalian cells, or alter- 
natively may be isolated from viral preparations, ft is also anticipated that the other structural proteins may also contain 
epitopes which give rise to protective anti-HCV antibodies. Thus, polypeptides containing the epitopes of E, C, and M 
may also be used, whether singly or in combination, in HCV vaccines. 

[0068] In addition to the above, it has been shown that immunization with NS1 (nonstructural protein 1), results in pro- 
35 tection against yellow fwer. Schlesinger et al (1986) J. Virol. fiQ:1153. This is true even thoutfi the immunization does 
not give rise to neutralizing antibodies. Thus, particularly since this protein appears to be highly conserved among Fla- 
viviruses, it is likely that HCV NS1 will also be protective against HCV infection. Moreover, it also shows that nonstruc- 
tural proteins may provide protection against viral pathogenicity, even if they do not cause the production of neutralizing 
antibodies. 

40 [0089] In view of the above, multivalent vaccines against HCV may be comprised of one or more epitopes from one 
or more structural proteins, and/or one or more epitopes from one or more nonstructural proteins. These vaccines may 
be comprised of. for example, recombinant HCV polypeptides and/or polypeptides isolated from the virions. In particu- 
lar, vaccines are contemplated comprising one or more of the following HCV proteins, or subunrt antigens derived there- 
from: E, NS1 , C, NS2, NS3, NS4 and NS5. Particularly preferred are vaccines comprising E and/or NS1 , or subunrts 

45 thereof. In addition, it may be possible to use inactivated HCV in vaccines; inactivation may be by the preparation of viral 
lysates, or by other means known in the art to cause inactivation of Flaviviruses, for example, treatment with organic 
solvents or detergents, or treatment with formalin. Moreover, vaccines may also be prepared from attenuated HCV 
strains or from hybrid viruses such as vaccinia vectors toown in the art [Brown et a). Nature 319 : 549-650 (1986)]. 
[0090] The preparation of vaccines which contain irnmunogenic polypeptide^) as active ingredients is taown to one 

so skilled in the art. Typically, such vaccines are prepared as injectabies, either as liquid solutions or suspensions; solid 
forms suitable for solution in, or suspension in, liquid prior to injection may also be prepared. The preparation may also 
be emulsified, or the protein encapsulated in liposomes. The active immunogenic ingredients are often mixed with 
excipients which are pharmaceutical^ acceptable and compatible with the active ingredient. Suitable excipients are, for 
example, water, saline, dextrose, glycerol, ethanol, or the like and combinations thereof. In addition, if desired, the vac- 

55 cine may contain minor amounts of auxiliary substances such as wetting or emulsifying agents. pH buffering agents, 
and/or adjuvants which enhance the effectiveness of the vaccine. Examples of adjuvants which may be effective include 
but are not limited to: aluminum hydroxide, N-acetyl-muramyl-L-threonyl-D-isoglutamine (thr-MDP), N-acetyl-nor- 
niuramyl-L-alanyl-D-isoglutamine (CQP 1 1637, referred to as nor-MDP), N-acetylrruramyl-L-alanyl-D-isoglutaminyl-L- 
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alanine-2-(V-2'<fipalmrtoyl^^Jycerd-3-hydroxyphosph (CQP 19835A, referred to as MTP-PE). 

and RIBI, which contains three components extracted from bacteria, monophosphoryt lipid A, trehalose dimycolate and 
cell wall skeleton (MPL+TDM+CWS) in a 2% squalene/Tween 80 emulsion. The effectiveness of an adjuvant may be 
determined by measuring the amount of antfoodies directed against an immunogenic polypeptide containing an HCV 
5 antigenic sequence resulting from administration of this polypeptide in vaccines which are also comprised of the various 
adjuvants. 

[0091 ] The vacdne6 are conventionally administered parenteralfy, by injection, usually, either subcutaneousiy or intra- 
muscularly. Additional formulations which are suitable for other modes of administration include suppositories and, in 
some cases, oral formulations. For suppositories, traditional binders and carriers may include, for example, polyalkylene 

10 glycols or triglycerides; such suppositories may be formed from mixtures containing the active ingredient in the range 
of 0.5% to 10%, preferably 1%-2%. Oral formulations include such normally employed exciptents as, for example, phar- 
maceutical grades of mannitol, lactose, starch, magnesium stearate, sodium saccharine, cellulose, magnesium carbon- 
ate, and the like. These compositions take the form of solutions, suspensions, tablets, pills, capsules, sustained release 
formulations or powders and contain 10%-95% of active ingredient preferably 25%-70%. 

15 [0092] The proteins may be formulated into the vaccine as neutral a salt forms. Pharmaceutical^ acceptable salts 
include the acid addition salts (formed with free amino groups of the peptide) and which are formed with inorganic acids 
such as, for example, hydrochloric or phosphoric acids, or such organic acids such as acetic oxalic, tartaric maleic, 
and the like. Salts formed with the free carboxyl groups may also be derived from inorganic bases such as, for example, 
sodium, potassium, ammonium, calcium, or ferric hydroxides, and such organic bases as isopropylarnine, trimethyl- 

20 amine, 2-ethylarrino ethanol, histidine, procaine, and the like. 

[0093] The vaccines are administered in a manner compatible with the dosage formulation, and in such amount as 
will be prophyf actually and/or therapeutically effective. The quantity to be administered, which is generally in the range 
of 5 micrograms to 250 micrograms of antigen per dcse, depends on the subject to be treated, capacity of the subject's 
immune system to synthesize antibodies, and the degree of protection desired. Precise amounts of active ingredient 

25 required to be administered may depend on the judgment of the practitioner and may be peculiar to each subject 
[0094] The vaccine may be given in a single dose schedule, or preferably in a multiple dose schedula A multiple dose 
schedule is one in which a primary course of vaccination may be with 1-10 separate doses, followed by other doses 
given at subsequent time intervals required to maintain and or reenforce the immune response, for example, at 1-4 
months for a second dose, and if needed, a subsequent dose(s) after several months. The dosage regimen will also, at 

30 least in part, be determined by the need of the individual and be dependent upon the judgment of the practitioner. 
[0095] In addition, the vaccine containing the imrminogenic HCV antigen(s) may be administered in conjunction with 
other immunoregulatory agents, for example, immune globulins. 

[0096] The immunogenic polypeptides prepared as described above are used to produce antfoodies, both polyclonal 
and monoclonal. If polyclonal antibodies are desired, a selected mammal (e.g., mouse, rabbit, goat, horse, etc) is 

3$ immunized with an immunogenic polypeptide bearing an HCV epitope(s). Serum from the immunized animal is col- 
lected and treated according to known procedures. If serum containing polyclonal antibodies to an HCV epitope con- 
tains antibodies to other antigens, the polyclonal antibodies can be purified by immunoafftnity chromatography. 
Techniques for producing and processing polyclonal antisera are known in the art, see for example, Mayer and Walker, 
eds. (1987) IMMUNOCHEMICAL METHODS IN CELL AND MOLECULAR BIOLOGY (Academic Press. London). 

40 [0097] Monoclonal antibodies directed against HCV epitopes can also be readily produced by one skilled in the art 
The general methodology for making monoclonal antibodies by hybridomas is well known. Immortal antibody-producing 
cell lines can be created by cell fusion, and also by other techniques such as direct transformation of B lyrnphocytes 
with oncogenic DNA, or transfection with Epstein- Ban* virus. See, e.g., M. Schreier et al. (1980) HYBRIDOMA TECH- 
NIQUES; Hammerling et al. (1981), MONOCLONAL ANTIBODIES AND T-CELL HYBRIDOMAS; Kennett et al. (1980) 

45 MONOCLONAL ANTIBODIES; Sfifl SlSSL U.S. Patent Nos. 4,341,761; 4,399,121; 4,427,783; 4,444.887; 4,466,917; 
4,472.500; 4,491,632; and 4,493,890. Panels of monoclonal antfoodies produced against HCV epitopes can be 
screened for various properties; La, for isotype, epitope affinity, eta 

[0098] Antibodies, both monoclonal and polyclonal, which are directed against HCV epitopes are particularly useful 
in diagnosis, and those which are neutralizing are useful in passive irnmunotherapy. Monoclonal antfoodies, in particu- 

50 lar, may be used to raise antiidiotype antibodies. 

[0099] Anti-idiotype antibodies are immunoglobulins which carry an Internal image" of the antigen of the infectious 
agent against which protection is desired. Techniques for raising anti-kfiotype antibodes are known in the art See, e,g„ 
Grzych (1985). Nature £1£:74; MacNamara et al. (1984), Science gg.1325, Uytdehaag et al (1985), J. Immunol. 
121:1225. These anti-idiotype antibodies may also be useful for treatment and/or diagnosis of NANBH, as well as for 

55 an elucidation of the immunogenic regions of HCV antigens. 

[0100] Using the HCV/J1 polynucleotide sequences as a basis, oligomers of approximately 8 nucleotides or more can 
be prepared, either by excision or synthetically, which hybridize with the HCV genome and are useful in identification of 
the viral agent(s). further characterization of the viral genome(s), as well as in detection of the virus(es) in diseased indi- 
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viduate. The probes for HCV polynucleotides (natural or derived) are a length which allows the detection of unique viral 
sequences by hybridization. While 6-8 nucleotides may be a workable length, sequences of about 10-12 nucleotides 
are preferred, and about 20 nucleotides appears optimal. These probes can be prepared using routine methods, includ- 
ing automated oligonucleotide synthetic methods. Among useful probes, for example, are the clones cfisciosed herein, 
5 as well as the various oligomers useful in probing cDNA Itoraries, set forth below. A complement to any unique portion 
of the HCV genome will be satisfactory. For use as probes, complete cornplementarity is desirable, though it may be 
unnecessary as the length of the fragment ts increased. 

[0101] For use of such probes as diagnostics, the biological sample to be analyzed, such as blood or serum, may be 
treated, if desired, to extract the nucleic acids contained therein. The resulting nucleic acid from the sample may be sub- 

10 jected to gel electrophoresis or other size separation techniques; alternatively, the nucleic acid sample may be dot blot- 
ted without size separation. The probes are then labeled. Suitable labels, and methods for labeling probes are known 
in the art. and include, for example, radioactive labels incorporated by nick translation or kinasing, biotin, fluorescent 
probes, and cherniluminescent probes. The nucleic acids extracted from the sample are then treated with the labeled 
probe under hybridization conditions of suitable stringencies. Usually high stringency conditions are desirable in order 

is to prevent false positives. The stringency of hybridization is determined by a number of factors during hybridization and 
during the washing procedure, including temperature, ionic strength, length of time, and concentration of farnamide. 
These factors are outlined in, for example, Maniatis. T. (1982) MOLECULAR CLONING; A LABORATORY MANUAL 
(Cold Spring Harbor Press, Cold Spring Harbor, N.Y.). 

[0102] Generally, it is expected that the HCV genome sequences will be present in serum of infected individuals at 

20 relatively low levels, i.e., at approximately lO^-IO 3 chimp infectious doses (CID) per ml. This level may require that 
amplification techniques be used in hybridization assays. Such techniques are known in the art For example. theEnzo 
Biochemical Corporation "Bio-Bridge" system uses terminal deoxynucleotide transferase to add unmodified 3'-pofy-dT- 
tails to a DNA probe. The poly dT-tailed probe is hybridized to the target nucleotide sequence, and then to a biotm-mod- 
rfied pdy-A. PCT App. No, 84/03520 and EPO Pub. Na 124,221 describe a DNA hybridization assay in which: (1) ana- 

25 lyte is annealed to a single-stranded DNA probe that is complementary to an enzyme-labeled oligonucleotide; and (2) 
the resulting tailed duplex is hybridized to an enzyme-labeled oligonudeotida EPO Pub. Na 204,510 describes a DNA 
hybridization assay in which analyte DNA is contacted with a probe that has a tail, such as a pory-dT tail, an amplifier 
strand that has a sequence that hybridizes to the tail of the probe, such as a poly- A sequence, and which is capable of 
binding a plurality of labeled strands. 

30 [01 03] A particularly desirable technique may first involve amplification of the target HCV sequences in sera approx- 
imately 10,000-fold. i.e.. to approximately 10 6 sequences/ml. This may be accomplished, for example, by the polymer- 
ase chain reactions (PCR) technique described which is by Saikj et al. (1986) Nature 324:163. MuJlis, US. Patent No. 
4,683,195, and MuJlis et al. U.S. Patent No. 4,683,202. The amplified sequences) may then be detected using a hybrid- 
ization assay which is described in co-pending European Publication Na 317*077 and Japanese application Na 63- 

35 260347. which are assigned to the herein assignee, and are hereby incorporated herein by reference. These hybridiza- 
tion assays, which should detect sequences at the level of 10 6 /ml, utilize nucleic acid muHimers which bind to single- 
stranded analyte nucleic acid, and which also bind to a multipticity of single-stranded labeled oligonucleotides. A suita- 
ble solution phase sandwich assay which may be used with labeled polynucleotide probes, and the methods for the 
preparation of probes is described in EPO Pub. No. 225,807 which is hereby incorporated herein by reference. 

40 [0104] The probes can be packaged into diagnostic kits. Diagnostic kits include the probe DNA, which may be labeled; 
alternatively, the probe DNA may be unlabeled and the ingredients for labeling may be included in the kit in separate 
containers. The kit may also contain other suitably packaged reagents and materials needed for the particular hybridi- 
zation protocol, for example, standards, wash buffers, as well as instructions for conducting the test 
[0105] Both the HCV/J1 polypeptides which react imrnunotogicalry with serum containing HCV antibodies and the 

45 antibodies raised against the HCV specific epitopes in these polypeptides are useful in immunoassays to detect pres- 
ence of HCV antfcodies. or the presence of the virus and/or viral antigens, in biological samples. Design of the immu- 
noassays is subject to a great deal of variation, and a variety of these are known in the art An immunoassay for anti- 
HCV antibody may utilize one viral epitope or several viral epitopes. When multiple epitopes are used, the epitopes may 
be derived from the same or Afferent viral polypeptides, and may be in separate recombinant or natural polypeptides, 

so or together in the same recombinant polypeptides. 

[0106] An immunoassay for viral antigen may use, for example, a monoclonal antfoody directed towards a viral 
epitope, a combination of monoclonal antibodies directed towards epitopes of one viral polypeptide, monoclonal anti- 
bodies directed towards epitopes of different viral polypeptides, polyclonal antibodies directed towards the same viral 
antigen, polyclonal antiboties directed towards different viral antigens or a combination of monoclonal and polyclonal 

55 antibodies. 

[0107] Immunoassay protocols may be based, for example, upon competition, or direct reaction, or sandwich type 
assays. Protocols may also, tor example, use solid supports, or may be by irrtrrwrwpreciprtation. Most assays involve 
the use of labeled antibody or polypeptide. The labels may be, for example, fluorescent cherniluminescent, radioactive, 
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or dye molecules. Assays which amplify the signals from the probe are also known. Examples of which are assays 
which utilize bkrtin and avidin, and enzyme-labeled and mediated immunoassays, such as EUSA assays. 
[0108] Typically, an immunoassay for anti-HCV antibody will involve selecting and preparing the test sample, such as 
a biological sample, and then incubating it with an antigenic (i.e., epitope-containing) HCV polypeptide under conditions 

5 that allow antigen-antibody complexes to form. Such conditions are well known in the art In a heterogeneous format, 
the polypeptide is bound to a solid support to facilitate separation of the sample from the polypeptide after incubation. 
Examples of solid supports that can be used are nitrocellulose, in membrane or microliter well form, polyvinylchforide. 
in sheets or microliter wells, polystyrene latex, in beads or microtiter plates, polyvinylidine fluoride, known as Immobu- 
lon™, diazotized paper, nylon membranes, activated beads, and Protein A beads. Most preferably, the Dynatech, Irrtmu- 

10 Ion™ 1 microtiter plate or the 0.25-inch polystyrene beads, which Spec finished by Precision Plastic Ball, are used in 
the heterogeneous formal The solid support is typically washed after separating it from the test sample. In a homoge- 
neous format the test sample is incubated with antigen in solution, under conditions that will precipitate any antigen- 
antibody complexes that are formed, as is know in the art The precipitated complexes are then separated from the test 
sample, for example, by centrifugation. The complexes formed comprising anti-HCV antfoody are then detected by any 

is of a number of techniques. Depending on the format the complexes can be detected with labeled anti-xenogeneic Ig 
or, if a competitive format is used, by measuring the amount of bound, labeled competing antibody. 
[0109] In immunoassays where HCV polypeptides are the analyte, the test sample, typically a biological sample, is 
incubated with anti-HCV antibodies again under conditions that allow the formation of antigen-antfoody complexes. Var- 
ious formats can be employed, such as a "sandwich" assay where anttoody bound to a solid support is incubated with 

20 the test sample; washed;incubated with a second, labeled antibody to the analyte; and the support is washed again. 
Analyte is detected by determining if the second antibody is bound to the support In a c om pe titi ve format which can 
be either heterogeneous or homogeneous, a test sample is usually incubated with and antibody and a labeled, compet- 
ing antigen either sequentially or simultaneously. These and other formats are well known in the art. 
[01 10] The Flavivirus model for HCV allows predictions regarcfing the likely location of diagnostic epitopes for the vir- 

25 ion structural proteins. The C, pre-M, M, and E domains are all likely to contain epitopes of significant potential for 
detecting viral antigens, and particularly for diagnosis. Similarly, domains of the rwnstructural proteins are expected to 
contain important diagnostic epitopes (e.g., NS5 encoding a putative polymerase; and NS1 encoding a putative com- 
plement-binding antigen). Recombinant polypeptides, or viral polypeptides, which include epitopes from these specific 
domains may be useful for the detection of viral antibodies in infections blood donors and infected patients. In addition, 

30 antibodies directed against the E and/or M proteins can be used in immunoassays for the detection of viral antigens in 
patients with HCV caused NANBH, and in infectious blood donors. Moreover, these antibodies may be extremely useful 
in detecting acute-phase donors and patients. 

[01 1 1 ] Antigenic regions of the putative poryprotein can be mapped and identified by screening the antigenicity of bac- 
terial expression products of HCV cDNAs which encode portions of the potyproteirt Other antigenic regions of HCV 
35 may be detected by expressing the portions of the HCV cDNAS in other expression systems, including yeast systems 
and cellular systems derived from insects and vertebrates. In addition, studies giving rise to an antigenicity index and 
riydrophobicrtymydrophilicity profile give rise to information concerning the probability of a region's antigenicity. Efficient 
detection systems may include the use of panels of epitopes. The epitopes in the panel may be constructed into one or 
multiple polypeptides. 

40 [01 1 2] Kits suitable for immunodiagnosis and containing the appropriate labeled reagents are constructed by pack- 
aging the appropriate materials, including the polypeptides of the invention containing HCV epitopes or antibodies 
directed against HCV epitopes in suitable containers, along with the remaining reagents and materials required for the 
conduct of the assay (e.g.. wash buffers, detection means like labeled anti-human Ig, labeled anti-HCV, or labeled HCV 
antigen), as well as a suitable set of assay instructions. 

45 [01 1 3] The HCV/J1 nucleotide sequence information described herein may be used to gain further information on the 
sequence of the HCV genomes, and foe identification and isolation of additional HCV isolates related to J1. This infor- 
mation, in turn, can lead to additional polynucleotide probes, polypeptides derived from the HCV genome, and antibod- 
ies directed against HCV epitopes which would be useful for the diagnosis and/or treatment of HCV caused NANBH. 
[01 1 4] The HCV/J 1 nucleotide sequence information herein is useful for the design of probes for the isolation of addi- 

so tional sequences which are derived from as yet undefined regions of the HCV genomes from which the J1 sequences 
are derived. For example, labeled probes containing a sequence of approximately 8 or more nucleotides; and preferably 
20 or more nucleotides, which are derived from regions close to the S-termini or 3*-termini of the family of HCV cDNA 
sequences disclosed in the examples may be used to isolate overlapping cDNA sequences from HCV cDNA libraries. 
These sequences which overlap the cDNAs in the above-mentioned clones, but which also contain sequences derived 

55 from regions of the genome from which the cDNA in the above mentioned clones are not derived, may then be used to 
synthesize probes for identification of other overlapping fragments which do not necessarily overlap the cDNAs 
described below. Methods for constructing cDNA libraries are known in the art See, e.g. EPO Pub. No. 318,216. It is 
particularly preferred to prepare libraries from the serum of Japanese and other Asian patients diagnosed as having 
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NANBH demonstrating anttoody to HCV1 antigens; these are believed to be the most likely candidates for carriers of 
HCV/J1 or related isolates. 

[01 1 5] HCV particles may be isolated from the sera from individuals with NANBH or from cell cultures by any of the 
methods Known in the art, including for example, techniques based on size discrimination such as sedimentation or 
5 exclusion methods, or techniques based on density such as ultracentrifugation in density gracfients, or precipitation with 
agents such as polyethylene glycol, or chromatography on a variety of materials such as anionic or cationic exchange 
materials, and materials which bind due to hydrophobicity. 

[0116] A preferred method of isolating HCV particles or antigen is by immunoaffinity columns. Techniques for immu- 
noaffinity chromatography are known in the art including techniques for affixing antibodies to solid supports so that they 

10 retain their immunoselective activity. The techniques may be those in which the antibodies are adsorbed to the support 
(see, for example, Kurstak in ENZYME IMMUNODIAGNOSIS, page 31 -37), as well as those in which the antibodies are 
ccvalently linked to the support Generally, the techniques are similar to those used for ccvalent linking of antigens to a 
solid support described abcva However, spacer groups may be included in the bifunctional coupling agents so that the 
antigen binding site of the anttoody remains accessible. The antibodies may be monoclonal, or polyclonal, and it may 

is be desirable to purify the antibodies before their use in the immunoassay. 

[0117] The general techniques used in extracting the genome from a virus, preparing and probing a cDNA Ifcrary. 
sequencing clones, constructing expression vectors, transforming cells, performing imrruinoiogicaJ assays such as radi- 
oimmunoassays and EUSA assays, for growing cells in culture, and the like are known in the art and laboratory manu- 
als are available descrfoing these techniques. However, as a general guide, the following sets forth some sources 

20 currently available for such procedures, and for materials useful in carrying them out 

[0118] Both rxokaryotic and eukaryotic host cells may be used for expression of desired coding sequences when 
appropriate control sequences which are compatible with the designated host are used. Among prokaryotic hosts, L. 
GCli is most frequently used. Expression control sequences for prokaryotes include promoters, optionally containing 
operator portkm and rfoosome binding sites. Transfer vectors cornpatM^ - 

25 from, tor example, pBR322, a piasrrid containing operons conferring ampitilGn and tetracycline resistance, and the var- 
ious pUC vectors, which also contain sequences conferring antfctotic resistance markers. These markers may be used 
to obtain successful transforrnants by selection. Commonly used prokaryotic control sequences include the Beta-Jacta- 
mase (penicillinase) and lactose promoter systems (Chang et al.(1977), Nature 198:1056. the tryptophan (trp) pro- 
moter system (Goeddel et al. (1980) Nudeic Acid Res. fi:4057), and the lambda-derived P L promoter and N gene 

30 riboGome binding site (Shimatake et al. (1981) Nature 292:128) and the hybrid tac promoter (De Boer et al. (1983) Proc. 
Natl. Acad. Sci. USA 292:128) derived from sequences of the to and lac UV5 promoters. The foregoing systems are 
particularly compatible with E. coli: if desired, other prokaryotic hosts such as strains of Bacillus or Pseudomonas may 
be used, with corresponding control sequences. 

[0119] Eukaryotic hosts include yeast and mammalian cells in culture systems. Saccharomyces cerevisiae. Saccha- 

35 romyces cartefaerqensis. Klebsiala lactis and Ptehia pastoris are the most commonly used yeast hosts, and are conven- 
ierrt fungal hosts. Yeast compatible vectors carry markers which permit selection of successful transforrnants by 
conferring pr o toli o p hy to auxotrophic mutants or resistance to heavy metals on wild-type strains. Yeast compatible vec- 
tors may employ the 2 micron origin of replication (Broach et al. (1 983) Math Enz. 101 307), the combination of CEN3 
and ARS1 or other means for assuring replication, such as sequences which will result in incorporation of an appropri- 

40 ate fragment into the host cell genome. Control sequences for yeast vectors are known in the art and include promoters 
for the synthesis of glycolytic enzymes (Hess et al. (1968) J. Adv. Enzyme Eng. £149; Holland et al. (1978). J. Biol. 
Chem. 256:1385). including the promoter for 3 phosphoglycerate kinase (Hitzeman (1980), J. Biol. Chem. 255:2073). 
Terminators may also be included, such as those derived from the enoiase gene (Holland (1981), J. Biol. Chem. 
256:1385). Particularly useful control systems are those which comprise the gryceraldehyde-3 phosphate dehydroge- 

45 nase (GAPDH) promoter or alcohol dehydrogenase (ADH) regulatabte promoter, terminators also derived from 
QAPDH, and if secretion is desired, leader sequence from yeast alpha factor. In addition, the transcriptional regulatory 
region and the transcriptional initiation region which are operabiy linked may be such that they are not naturally associ- 
ated in the wild-type organism. These systems are described in detail in EPO Pub. No 120.551 ; EPO Pub. No. 1 16,201 ; 
and EPO Pub. Na 164,556 all of which are incorporated herein by reference. 

so [0120] Mammalian cell lines available as hosts for expression are known in the art and include many immortalized 
cell lines available from the American Type Culture Collection (ATCC), including HeLa cells, Chinese hamster ovary 
(CHO) cells, baby hamster kidney (BHK) cells, and a number of other cell lines. Suitable promoters for mammalian cells 
are also known in the art and include viral promoters such as that from Simian Virus 40 (SV40) (Rers (1978), Nature 
273:113V Rous sarcoma virus (RSV), adenovirus (AOV). and bovine papilloma virus (BPV). Mammalian cells may also 

55 require terminator sequences and poly A addition sequences; enhancer sequences which increase expression may 
also be included, and sequences which cause amplification of the gene may also be desirable. These sequences are 
known in the art Vectors suitable for replication in mammalian cells may include viral replicons, or sequences which 
insure integration of the appropriate sequences encoding NANBV epitopes into the host genome. 
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[01 21 ] The vaccinia virus system can also be used to express foreign DNA in mammalian cells. To express heterolo- 
gous genes, the foreign DNA is usually inserted into the thymidine kinase gene of the vaccinia virus and then infected 
cells can be selected. This procedure is known in the art and further information can be found in these references 
[Mackett et al. J. Virol. 42: 857-864 (1984) and Chapter 7 in DNA Clonino. Vbi. 2. IRL Press]. 

5 [01 22] In addition, viral antigens can be expressed in insect celts by the BacuJovirus system. A general guide to bac- 
ulovirus expression by Summer and Smith is A Manual of Methods for Baculovirus Vectors and Insect Cell Culture Pro- 
cedures (Texas Agricultural Experiment Station Bulletin No. 1555). To incorporate the heterologous gene into the 
Baculovirus genome the gene is first cloned into a transfer vector containing some Baculovirus sequences. THis trans- 
fer vector, when it is cotransfected with wild-type virus into insect cells, will recombine with the wild-type virus. Usually, 

w the transfer vector will be engineered so that the heterologous gene will disrupt the wild-type Baculovirus polyhedron 
gena This disruption enables easy selection of the recombinant virus since the cells infected with the recombinant virus 
will appear phenotypically different from the cells infected with the wild-type virus. The purified recombinant virus can 
be used to infect cells to express the heterologous gene. The foreign protein can be secreted into the medium if a signal 
peptide is linked in frame to the heterologous gene; otherwise, the protein will be bound in the cell lysatee. For further 

15 information, see Smith et al MoT & Cell. Bid. 32156-2165 (1983) a Luckow and Summers in Virology 1Z: 31-39 (1989). 
[0123] Transfo r m ati on may be by any known method for introducing polynucleotides into a host cell, including, for 
example packaging the polynucleotide in a virus and transducing a host cell with the virus, and by direct uptake of the 
polynucleotide. The transform a tion procedure used depends upon the host to be transformed. Bacterial transfor ma tion 
by direct uptakB generally employs treatment with calcium or rubidium chloride (Cohen (1972), Proa Nat). Acad. Sci. 

20 USA £221 10; Maniatts et al. (1982). MOLECULAR CLONING; A LABORATORY MANUAL (Cold Spring Harbor Press, 
Cold Spring Harbor, N.Y). Yeast tran s format i on by direct uptake may be earned out using the method of Hirtnen et al. 
(1978) Proc. Natl. Acad. Sci. USA Z5: 1929. Mammalian tra ns formations by direct uptake may be conducted using the 
calcium phosphate precipitation method of Graham and Van der Eb (1978), Virology 52:546 or the various known mod- 
ifications thereof. 

25 [0124] Vector construction employs techniques which are known in the art Site-specific DNA cleavage is performed 
by treating with suitable restriction enzymes under conditions which generally are specified by the manufacturer of 
these commercially available enzymes. The cleaved fragments may be separated using polyacrylamide or agarose gel 
electrophoresis techniques, according to the general procedures found in Methods in Enzymotogy (1980) 65:499-560. 
Sticky ended cleavage fragments may be blunt ended using E. coH DNA polymerase I (Wencw) in the presence of the 

30 appropriate decxynudeotide triphosphates (dNTPs) present in the mixture. Treatment with S1 nuclease may also be 
used, resulting in the hydrolysis of any single stranded DNA portions. 

[0125] Ligations are carried out using standard buffer and temperature conditions using T4 DNA ligase and ATP; 
sticky end ligations require less ATP and less ligase than blunt end ligations. When vector fragments are used as part 
of a ligation mixture, the vector fragment is often treated with bacterial alkaline phosphatase (BAP) or calf intestinal 

6 alkaline phosphatase to remove the S'-phosphate and thus prevent reiigation of the vector; alternatively, restriction 
enzyme digestion of unwanted fragments can be used to prevent ligation. Ligation mixtures are transformed into suita- 
ble cloning hosts, such as cot and successful transformants selected by, for example, antibiotic resistance, and 
screened for the correct construction. 

[0126] Synthetic oligonucleotides may be prepared using an automated oligonucleotide synthesizer as descrfoed by 
40 Warner (1 984), DNA 3:401 . If desired, the synthetic strands may be labeled with M P by treatment with polynucleotide 
kinase in the presence of ^P-ATP, using standard condrtfons for the reaction. DNA sequences, including those isolated 
from cDNA Ibraries, may be modified by known techniques, including, for example site directed mutagenesis, as 
described by Zoller (1982), Nucleic Acids Res. 1&6487. 

[0127] DNA libraries may be probed using the procedure of Grunstein and Hogness (1975), Proc Natl. Acad. Sci. 

45 USA 23:3961. Briefly, in this procedure, the DNA to be probed is immobilized on nitrocellulose filters, denatured, and 
prehybridized with a buffer. The percentage of formamide in the buffer, as well as the time and temperature concftions 
of the prehybridization and subsequent hybridization steps depends on the stringency required. Oligomeric probes 
which require lower stringency conditions are generally used with low percentages of formamide, lower temperatures, 
and longer hybridization times. Probes containing more than 30 or 40 nucleotides such as those derived from cDNA or 

so genomic sequences generally employ higher temperatures, e.g., about 40-42°C, and a high percentage, e.g., 50%, for- 
mamide. Following prehybridization, S'-^P-labeled oligonucleotide probe is added to the buffer, and the f aters are incu- 
bated in this mixture under hybridization conditions. After washing, the treated filters are subjected to autoradiography 
to show the location of the hybrid zed probe; DNA in corresponding locations on the original agar plates is used as the 
source of the desired DNA. 

55 [01 28] An enzyme-finked immunosorbent assay (EUSA) can be used to measure either antigen or antibody concen- 
trations. This method depends upon conjugation 

enzyme activity as a quantitative label. To measure antibody, the known antigen is fixed to a solid phase (e.g., a micro- 
plate or plastic cup), incubated with test serum dilutions, washed, incubated with anti-immunoglobulin labeled with an 
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enzyme, and washed again. Enzymes suitable for labeling are known in the art, and include, for example, horseradish 
peroxidase. Enzyme activity bound to the solid phase is measured by adding the specific substrate, and determining 
product formation or substrate utilization colorimetrically. The enzyme activity bound is a direct function of the amount 
of antibody bound. 

[0129] To measure antigen, a known specific antibody is fixed to the solid phase, the test material containing antigen 
is added, after an incubation the solid phase is washed, and a second enzyme-tabeled antibody is added. After wash- 
ing, substrate is added, and enzyme activity is estimated colorimetrically, and related to antigen concentration. 

Examples 

I 

[01 30] This example descrbes the doning of the HCV/J1 and HCV/J7 nucleotide sequences. 
[0131] Both blood samples which were used as a source of HCV virions were found to be positive in an anti-HCV 
antibody assay. THe HCV isolates from these samples were named HCV/J1 and HCV/J7. The inf activity of the blood 
sarrple containing the J1 isolate was confirmed by a prospective study of blood transfusion recipients. Dr. Tohru 
Katayamafrom the Department of Surgery at the National Tokyo Chest Hospital collected blood from patients who have 
contracted post-transfusion non-A, non-B hepatitis. He also collected blood samples from the respective blood donors 
of these patients. Next, these samples were assayed for antibodies to the C100-3 HCV1 antigen (EPO Pub, No. 
318,216). and blood from one of the donors was found to be positive. 

[0132] Isolation of the RNA from the blood samples began by pelleting virions in the blood sample by uttracentrifuga- 
tion [Bradley, D.W.. McCaustland, KA, Cook E.H., Schable, C.A.. Bbert, J.W. and Maynard. J.E. (1985) Gastroenterol- 
ogy £L 773-779]. RNA was then extracted from the pellet by the guarudiniurrvcesium chloride method [Maniatis T, 
Frttsch, E.F.. and SambrookJ. (1982) "Molecular Cloning: A Laboratory Manual". Cold Spring Harbor Laboratory, Cold 
Spring Haiter] and further purified by phenol/chtoroform extraction in the presence of urea, [Berk, AJ. Lee.F, Harrison, 
T, Williams, J. and Sharp, PA. (1979) Cell 17. 935-944]. 

[0133] Five pairs of synthetic oligonucleotide primers were designed from the C/E, E, E/NS1 , NS3, and NS5 domains 
of the nucleotide sequence of HCV1 to isolate fragments from the J1 and J7 genome. The first set of primers were to 
isolate the sequence from the core and some of the envelope domain. The second set of primers were to isolate the 
sequences in the envelope domain. The third set of primers were to isolate a fragment which overlapped the putative 
envelope and non-structural one, NS1 , domains. The fourth and fifth set of primers were used to isolate fragments from 
non-structural domains three and five, NS3 and NS5. The sequences for the various primers are shown below: 
The sequence of the primers for the C/E region were: 

21 S 5' CQTQCCCCCQCAAQACTQCT 3' 
J80A 5' CCGTCCTCCAGAACCCGQAC 3' 

The sequence of the primers for the E region were: 

71 S 5' GCCGACCTCATGGGQTACAT 3' 
J132A 5' AACTGCGACACCACTAAGGC 3' 

The sequence of the primers for the E/NS1 region were: 

1 27S 5' TG^ATQQQATATQATQATQ 3' 
1 66A 5' TTQAACTTGrrQQTQATAGAA 3' 

The sequence of the primers for the NS3 region were : 

464S 5' GGCTATACCGGCGACTTCGA 3' 
526A 5' GAGATGCATGTCATGATGTA 3' 

The sequence of the primers for the NS5 region were: 

870S 5' GCTGGAAAGAGGGTCTACTA 3' 
91 7A 5' QTTCTTACTQCCCAGTTQAA 3' 

[0134] 1 jig of the antrsense primers. 166A, 526A, or 917A, was added to 10 units of reverse transcriptase (Biorad) 
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to synthesize cDNA fragments from the isolated RNA as the template. The cDNA fragments were then amplified by a 
standard polymerase chain reaction [Saiki, R.K., Scharf, S., Faloona, F., Mullis, K.B., Horn G.T., Erlich, HA, and Arn- 
heim, N. (1985) Science 220* 1350-1354] after 1 ng of the appropriate sense primer. 21 S, 71 S. 127S. 464S or 870S, 
was added. 

5 [01 35] The cDNA fragments amplified by the PCR method were gel isolated and cloned by blunt-end ligation into the 
Smal site of pUC1 19 fVieira, J. and Messing, J. (1987) Methods in Enzymology 152, 3-1 1] or into the SnaBI site of cha- 
romid SB. a derivative of the cloning vector charomid 9-42 [Saita, I. and Stark, G. (1986) Proa Natl. Acad. Set. USA & 
8664-8668]. Clones which contain the fragments of the five viral domains were successfully constructed. 

w U 

[01 36] From the PCR reaction of the Japanese isolates, J 1 and J7, three independent clones from each region, C/E, 

E, E/NS1, NS3, and NS5, have been sequenced by the dideaxy chain termination method. 

[0137] Sequence from all regions except C/E has been isolated from the J1 isolate. Sequence from only the C/E 

is region has been isolated from the J7 isolate. Surprisingly, fragments isolated from both isolates are neither longer or 
shorter than what would be predicted from the HCV1 genome. However, there is heterogeneity between clones con- 
taining sequence from the same region. Consequently, a consensus sequence was constructed for each of the 
domains, C/E, E, E/NS1, NS3 and NS5, as shewn respectively in Figures 1 through 5. These differences may be 
explained as artifacts which occur randomly during the PCR amplification [SaiM, R.K., Scharf, S., Faloona. F. t Mullis, 

20 K.B., Horn. G.T., Erlich, HA, and Arnheim. N. (1985) Science 230, 1350-1354]. Another explanation is that more than 
one virus genome is present in the plasma of a single healthy carrier and that these genomes are heterogeneous at the 
nucleotide level. 

[0138] To clarify this point it was determined how many of these nucleotide differences would lead to amino acid 
changes, using the sequence from the NS3 domain of the J1 isolate as an example. Out of the five nucleotide differ- 

25 ences, three fall on the third position of the amino acid codon and do not change the amino acid sequence. Both of the 
remaining two nucleotide changes tall on the first position of the amino acid codon and generate amino acid changes 
of threonine to alanine and proline to alanine, all of which are small, neutral amino acid residues. Similarly, when ana- 
lyzing the nucleotide differences in other domains, many silent and conserved mutations are found. These results sug- 
gest that nucleotide sequences of the HCV genomes in the plasma of a single healthy donor are heterogeneous at the 

30 nucleotide level. 

[01 39] In addition, once the consensus sequences for each of the fragments were compiled each sequence was com- 
pared to the HCV1 isolate in Figures 6 through 1 0. In Figure 6 the fragment from the C/E region of the J7 isolate shews 
a 92.8%, 512/552, nucleotide and 97.4%, 150/154, amino acid homology to the HCV1 isolate. The fragment from the E 
domain of J1 shows a slightly lower nucleotide and amino acid homology to HCV1 in Figure 7 of 76.2% and 82.9%, 

35 respectively. The fragment from the J1 isolate which overlaps the envelope and non-structural one domains shows the 
lowest homology to HCV1, as seen in Figure 8, where the J1 isolate has a 71.5% nucleotide homology and a 73.5% 
amino acid homology to HCV1 . Figure 9 shows a comparison of the fragment from the NS3 domain of J 1 to HCV1 . The 
homology between the nucleotides sequences is 79.8%, while the amino acid homology between the isolates is quite 
high, 92.2% or 179/194 amino acids. Figure 10 shows the homology between the NS5 sequences from J1 and HCV1 . 

40 The sequences have a 84.3% nucleotide and 88.7% amino acid homology. 

[0140] The vectors described in the examples above were deposited with the Patent Microorganism Depository, Fer- 
mentation Institute, Agency of Industrial Science and Technology at 1 -3, Higashi 1 -chome Tsukuba-chi. Ibaragiken 305, 
Japan, and will be maintained under the provisions of the Budapest Treaty. The accession numbers and dates of the 
deposit are listed below, on page 68. 

45 

ill 

[0141] An HCV/J1 clone, J1-1519, was isolated using the essentially the techniques described above. However, the 
primers used in the isolation were J159S and 199 A. The sequences of the digomeric primers J159S and 199 A, which 
so follow, Were based upon those in J 1 -121 6 and in HCV1 . 

J159S 5' ACT GCC CTG AAC TGC AAT GA 3' 
199 A 5' AAT CCA GTT GAG TTC ATC CA 3' 

55 [0142] Clone JM 51 9 is comprised of an HCV cDNA sequence of 367 nucleotides which spans most of the 5'-haJf of 
the NS1 region and which overlaps the E -region done, J1-1216, by 31 nudeotides. Three independent 10 clones span- 
ning this region were sequenced; the sequences in this region obtained from the three dones were identical. The 
sequence of the HCV cDNA in J1 -121 6 (shown in the figure as J1) and the amino acids encoded therein (shown above 
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the nucleotide sequence) are shown in Figure 13. Figure 13 also shows the sequence differences between J1-1216 in 
the comparable region of the prototype HCV1 cDNA (indicated in the figure as PT), and the resulting changes in the 
encoded amino acids. The homology between the J 1-1 21 6 and HCV1 cDNA is approximately 70% at the nucleotide 
Iwel, and about 75% at the amino acid level. 

s [0143] A composite of the sequences from the putative core to NS1 region of the J1 isolate is shown in Figure 14; 
also shown in the figure are the amino acids encoded in the J1 sequence. The variation from the HCV1 prototype 
sequence is shown in the line below the J1 nucleotide sequence; the dashed lines indicate homologous sequences. 
The nonhomologous amino acid encoded in the HCV1 prototype sequence is shown below the HCV1 nucleotide 
sequence. 

10 [0144] Cloned material containing the J1/1519 HCV cDNA (psM519) has been maintained in DH5a and deposited 
with the Patent Microorganism Depository. 

jy 

15 [0145] Several regions of the J1 isolate, including the C200-C100 region from the putative NS3-NS4 region (which 
encompasses the region encoding the 5-1-1 polypeptide in HSV1 (See EPO Pub. No. 318,216), and the putative NS1 
- E region, were amplified using the PCR method. The C200-C1 00 region includes nucleotides 3799 to 5321 of the pro- 
totype HCV1 . RNA was extracted as described above, except that extraction was with guaridintum thiocyanate in the 
presence of Proteinase K and sodium dodecylsutfate (SDS) (Maniatis (1982), supra). The RNA was transcrtoed into 

20 HCV cDNA by incubation in a 25 mJ reaction comprised of 1 uM of each primer, 40 units of RNase inhfottor (RNASIN), 
5 units of AMV reverse transcriptase, and salts and buffer necessary for the reaction. Amplification of a segment of the 
HCV cDNA from the designated region was performed utilizing pairs of synthetic oligomer 16-mer primers. PCR ampli- 
fication was accomplished in three rounds (PCR I, PCR II, and PCRIII). The second and third rounds of PCR amplifica- 
tion (PCR II) utilized different sets of PCR primers; the first PCR reaction was diluted 10-fold and multiple rounds of 

25 PCR amplification were carried out with the new primers, so that ultimately up to 50% of the products of the first PCR 
reaction (PCR I) were reampiified. The primers used for the amplification of the regions were the following. These prim- 
ers, with the exception of J1C200-3 which was derived from the J1 isolate sequence, were derived from the prototype 
HCV1 sequence. 

30 Primers for amplification of the "5-1 -r region from NS3-NS4. 

[0148] 

PCR I 

35 511/16A (sense, derived from nucleotides starting at number 1528 of HCV1) 
5* AAC AGO CTQ CQT QQT C 3' 
511/16B (anti-sense, derived from nucleotides ending at 5260 of HCV1) 

40 

5* AQT TGG TCT GQA CAQ C 3' 
PCR II 

511A5A (sense, the HCV portion derived from nucleotides starting at number 5057 of HCV1, the restriction 
45 enzyme site is underlined) 

5' CTTGAATTC TCQ TCT TQT CCQ QQA AQC COO C AA TC 3* 

511/35B (anti-sense, the HCV portion derived from nucleotides ending at number 5233 of HCV1 ; the restriction 
so enzyme site is underlined) 

. 5* CTTGAATTC CCT CTQ CCT GAC GGQ ACQ CGQ TCT GC 3* 

PCRIII 

55 511/35A (see supra) 

VSNrc7 (antisense, derived from nucleotides ending at number 5804 of HCV1) 

5' GTA GTQ CQT GGG GGA AAC AT 3' 
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Primers for amplification of the "NS1/E* region 
[0147] 
5 PGR I 

(sense, the HCV portion derived from nucleotides starting at number 953 of HCV1 . the restriction enzyme 
site is underlined) 

5 1 CTTAGAATTC TGQ CAT QGQ ATA TGA TGA TQ 3' 

10 

J1(E)4 (sense, the HCV portion derived from nucleotides starting at number 1 087 of HCV1 , the restriction enzyme 
site is underlined) 

5* CHAQMEIC TCC ATQ GTQ GQG AAC TGQ GC 3' 

15 

(anti-sense, the HCV portion derived from nucleotides ending at 1 995 of HCV1 , the restriction enzyme site 
is underlined) 

5 1 CTTGAATTC TAA CGG GCT QAQ CTC GQA 3' 

20 

Jlrcl3 (anti-sense, the HCV portion derived from nucleotides ending at 1941 of HCV1 , the restriction enzyme site 
is underlined) 

5' CTTAGAATTC CGT CCA GTT GCA GGC AGC TTC 3' 

25 

PCR II 

J1rc13(see supra) 

J1IZ-1 (sense, the HCV portion is derived from nucleotides starting at number 1641 of HCV1, the restriction 
enzyme site is underlined) 

30 

5' CTTGAATTC CAA CTG GTTCGGCTG TAC A 3' 

(sense, the HCV portion is derived from nucleotides starting at number 1596 of HCV1, the restriction 
enzyme site is underlined) 

35 

5' TGA GAC GGA CGT GCT GCT CCT 3' 
Primers for the C200-C1OO reoion of the "NS3-NS4" region 
40 [0148] 

PCR I 

J1C200-1 (sense, derived from nucleotides starting at number 3478 of HCV1) 
45 5' TCC TAC TTG AAA GGC TC 3' 

J1C200-3 (anti-sense, derived from nucleotides ending at number 4402 of HCV1) 
5' GGA TCC AAG CTG AAA TCG AC 3' 

50 

J1rc52 (anti-sense, the HCV portion derived from nucleotides ending at 5853 of HCV1 , the restriction enzyme site 
is underlined) 

5' CTTAGAATTC GAG GCT GCT GAG ATA GGC AGT 3' 

55 

511/16A (see above). 
PCR II 

J1C200-2 (sense, the HCV portion derived from nucleotides starting at number 3557 of HCV1, the restriction 
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enzyme site is underlined) 

5' CTTGAATTC CCC GTG GAG TGG CTA AGG CGG TGG ACT 3' 

5 J1C200-4 (anti-sense, the HCV portion derived from nucleotides ending at 4346 of HCV1 , the restriction enzyme 
site is underlined) 

5' CTTGAATTC TCG AAG TCG CCG GTA TAG CCG GTC ATG 3' 

io 511/35A (see above) 

J1rc51 (anti-sense, the HCV portion derived from nucleotides ending at 5826 of HCV1 , the restriction enzyme site 
ts underlined) 

5' CUftGASIIQ GGC AGC TGC ATC GCT CTC CGG C AC 3' 

15 

The amplified HCV cDNAs were either sequenced directly without cloning, and/a were cloned. Sequencing 
was accomplished using an assymetric PGR technique, essentially as descrfoed in Shyamala and Ames, J. Bacte- 
riology 121:1 602 (1 989). In this technique, amplification of the cDNA is carried out with a limiting concentration of 
20 one of the primers (usually in a ratio of about 1 :50) in order to get preferential amplification of one strand. The pref- 
erentially amplified strand is then sequenced by the dideoxy chain termination method. 

The primers used for assymetric sequencing by the PCR method were the following. For the NS1 region: J1IZ- 
1 and Jlrcl3 (sequenced with both); J1IZ-2, Jlrc13 (confirmed on both strands). For the NS3-NS4 region, which 
includes the C200-C100 N-terminal region, C200-C100 (^terminal region, and the 5-1-1 region: J1C200-2 and - 
2S J1C200-7 (for the N-terminal region of C200-C100), and J1C20XM and J1C200-6 (for the C200-C100 C-terminal 
region); and 51 1/35A and hep 4 (for the 5-1-1 region). The sequences for J1C200-2, J1C200-4, and 51 1/35A are 
shown supra; the sequences of hep 4, J1 C200-6, and J1 C200-7 are the following. 
hep 4 (derived from nucleotides starting at number 5415 of HCV1) 

30 5' TT GGC TAG TGG TTAGTG GGC TGG TGA CAG 3' 

J1C200-6 (the HCV portion derived from nucleotides starting at number 3875 of HCV1 , the restriction enzyme site 
is underlined) 

35 5' CTTGAATTC CGT ACT CCA CCT ACQ GCA AGT TCC TT 3' 

J1C200'7 (the HCV portion derived from nucleotides starting at number 3946 of HCV1 , the restriction enzyme site 
is underlined) 

40 5' CTTGAATTC GTG GCA TCC GTG GAG TGG CAC TCG TC 3' 

[0149] The sequences obtained by assymetric sequencing of the "NS1 " region, the C200-C1 00 region, and the 5-1 - 
1 region are shown in Figure 15, and Figure 16, respectively. In the figures, the amino acids encoded in the J1 
sequence are shown above the J1 nucleotide sequence. The differences between the J1 sequence and the HCV1 pro- 
45 totype nucleotide sequence is shown below the J1 sequence (the dashes indicate homologous nucleotides in both 
sequences). The encoded amino acids which differ in the HCV1 prototype sequence are shown below the HCV1 nucle- 
otide sequence. 

[0150] HCV cDNAs from the NS1 region, the C200-C100 region, and the 5-1-1 region were cloned. A 300 bp and a 
230 bp fragment from the putative NS1 region, were cloned into a derivative of the commercially available vector, 

so pGEM-3Z, in host HB101, and deposited with the ATCC as AW-300bp. The derivative vectors maintain the original 
PGEM-3Z polylinkers, an intact Amp r gene, and the genes required for replication in £. £gli. The HCV cDNA fragments 
may be removed with Sad and Xbal. HCV cONAs containing 770 bp N-terminal fragments of C200 were cloned into 
pM1 E in HB1 01.12 clones were pooled and deposited with the ATCC as AW-770bp-N; the HCV cDN A may be removed 
from the vector with Haell. The resultant Haell fragment will contain vector DNA of 300 bp and 250 bp at the 5' and 3' 

ss ends, respectively. HCV cONAs containing 700 bp C-terminal fragments of C200 (AW-700bp-C) were cloned into 
M13mp10 and maintained in host DH5a-F'; cloning was into the vector polylinker srta The resultant phage were 
pooled, and deposited with the ATCC on September 11, 1990 as AW-700bp-N or AW-700bp-C. HCV cDNA from J1 
equivalent to the 5-1-1 region of HCV1 was cloned into mp19EcoR1 site, and maintained in DH5a-F. Several m13 
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phage superanants from this cloning were pooled and deposited with the ATCC as J1 5-1 *1. on September 11,1 990. 
The HCV cDNAs may be obtained from the phage by treatment with EcoRI. Accession numbers for J1 5-1-1 and AW- 
700bp-N or AW-700bp-C may be obtained by telephoning the ATCC at (301) 881-2600. 
[01 51 ] The above^descrfoed cloned material was deposited with the American Type Culture Collection (ATCC). 

5 

Y 

[0152] An HCV cDNA library containing sequences of the putative "NS1" region of the J 1 isolate was created by direc- 
tional cloning in X-gt22. The "NS1" region extends from about nucleotide 1460 to about nucleotide 2730 using the num- 

io bering system of the HCV1 prototype nucleic acid sequence, where nucleotide 1 is the first nucleotide of the initiating 
methionine codon for the putative polyprotein. The cloning was accomplished using essentially the method described 
by Han and Rutter in GENETIC ENGINEERING, Vol 10 (J.K. Setlow, Ed, Plenum Publishing Co., 1988), except that 
the primers for the synthesis of the first and second strand of HCV cDNA were JHC67 and JHC68, respectively, and the 
source of RNA was the J 1 plasma. In this method the RNA rs extracted with guanidum thiocyanate at a low tempera- 

15 ture. The RNA is then converted to fufl length cDNA, which is cloned in a defined orientation relative to the lafiZ pro- 
moter in X-phage. Using this method, the HCV cONAs to J1 RNA were inserted into the Notl site of X-gt22. The 
presence of "NS1 " sequences in the Ibrary was detected using as probe, AbcS4. 

[0153] The sequence of a region of "NS1" downstream from the region shown in Figure 14, but which overlaps the 
region by about 20 nucleotides, was determined using the assymetric sequencing technique described above, but sub- 
20 stituting as primers fa PCR amplification, Alx 61 and Aix 62. The resulting sequence is shown in Figure 1 7. (It should 
be noted that the PCR amplification was of a region from about nucleotide 1930 to about nucleotide 2340; this region 
is also encompassed in the sequence shown in Figure 15). The sequences of the primers and probes used to obtain 
the HCV cDNA library in X-gt22, and to sequence the portion of the "NS1 " region were the following. 

26 JHC 67 

5' GACGC GGCCG CCTCC GTGTC CAGCG COT 3' 

JHC 63 

30 

5' CGTGC GGCCG CAAGA CTGCT AGCCG AGGT 3' 
ALX 61 

as 5' ACCTO CCACT GTGTA GTGGT CAGCA GTAAC 3' 

ALX 62 

5' ACGGA CGTCT TCGTC CTTAACAATA CCAGG 3' 

40 

ALX 54 

5' GAACT TTGCG ATCTG GAAGACAGGG ACAGG 3' 

4$ [0154] A 400 bp fragment of J1 HCV cDNA derived from the sequenced region was cloned into pGEM3z and main- 
tained in HB101 ; the HCV cDNA may be removed from the vector with Sacl and Xbal. Host cells transformed with the 
vector (JH-400bp) have been deposited with the ATCC. 

[0155] A pooled cDNA Iforary was created from the J1 serum; the pooled library spans the J 1 genome and is identified 
as HCV-J1 X gt22. The pooled cDNA library was created by pooling aliquots of 1 1 individual cDNA libraries, which had 
so been prepared using the directional cloning technique described above, except that the Itoraries were created from 
primers which were designed to yield HCV cONAs which spanned the genome. The primers were derived from the 
sequence of HCV1 , and included JHC 67 and JHC 68. The HCV cONAs were inserted into the Notl site of X-gt22. The 
pooled cDNA Ibrary, HCV-J1 X gt22, has been deposited with the ATCC. 

55 VI 

[0156] The sequence of a region of the polynucleotide upstream of that shown in Figure 14 was determined. This 
region begins at nucleotide -267 with respect to the HCV-1 (See Figure 12) and extends for 560 nucleotides. Sequenc- 
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ing was accomplished by preparing HCV cDNA from RNA extracted from J1 serum, and amplifying the HCV cDNA 
using the PCR method. 

[0157] RNA was extracted from 100 jd of serum following treatment with proteinase K and sodium dodecylsulfate 
(SDS). The samples were extracted with phenol-chloroform, and the RNA precipitated with ethanol. 
[0158] HCV cDNA from the J1 isolate was prepared by denaturing the precipitated RNA with 0.01 M MeHgOH; after 
ten minutes at room temperature, 2-mercaptoethanol was added to sequester the mercury ions. Immediately, the mix 
for the first strand of cONA synthesis was added, and incubation was continued for 1 hr at 37°C. The conditions for the 
synthesis of the anti-sense strand were the following: 50 mM Tris HO, pH 83, 75 mM KCI, 3 mM MgO z . 10 mM drthio- 
thrertol, SOOuM each deoxynudeotide triphosphate, 250 pmol specific antisense cDNA primer r25, 250 units MMLV 
reverse transcriptasa In order to synthesize the second strand (sense), the synthesis reaction components were 
added, and incubated for one hour at 1 4°C. The components for the second strand reaction were as follows: 14 mM Tris 
HO, pH 8.3. 68 mM KO, 7.5 mM ammonium sulfate, 3.5 mM MgCfe, 2.8 mM ditNothreriol. 25 units DNA polymerase I. 
and one unit RNase H. The reactions were terminated by heating the samples to 95°C for 1 0 minutes, followed by cool- 
ing on foa 

[0159] The HCV cDNA was amplified by two rounds of PCR. The first round was accomplished using 20 ul of the 
cDNA mix. The conditions for the PCR reaction were as follows: 10 mM Tris HO. pH 8.3. 50 mM KCI. 1.5 mM MgCfe. 
0.002% gelating, 200 mM each of the deoocynucleotide triphosphates, and 2.5 units Amplrtaq. The PCR thermal cycle 
was as follows: 94°C one minute, 50°C one minute, 72°C one minute, repeated 40 times followed by seven minutes at 
72°C. The second round of PCR was accomplished using nested primers (La primers which bound to an internal region 
of the first round of PCR amplified product) to increase the specificity of the PCR products. One percent of the first PCR 
reaction was amplified essentially as the first round, except that the primers were substituted, and the second step in 
the PCR reaction was at 60°C instead of 50°C. The primers used for the first round of PCR were ALX90 and r14. The 
primers used for the second round of PCR were M4 and p14. 

[01 6Q] The sequences of the primers for the synthesis of HCV cDNA and for the PCR method were the following. 
r25 

5' ACC TTA CCC AAA TTQ CQC QAC CTA 3' 
ALX90 

5' CCA TQA ATC ACT CCC CTQ TGA GGA ACT A 3' 

M4 

5' GGG CCC CCAG CTA QGC CQA OA 3' 

p14 

5' AAC TAG TOT CTT CAC OCA QAA AQC 3' 

[0161] The PCR products were gel purified, the material which migrated as having about 615 bp was isolated, and 
sequenced by a modification of the Sanger dideoxy chain termination method, using ^P-ATP as label. In the modrfied 
method, the sequence replication was primed using P32 and R31 as primers; the double stranded DNA was melted for 
3 minutes at 95°C prior to replication, and the synthesis of labeled dideoxy terminated polynucleotides was catalyzed 
by Bst polymerase (obtained from BioRad Corp.). according to the manufacturer's directions The sequencing was per- 
formed using 500ng to 1 ug of PCR product per sequencing reaction. 

[0162] The primers P32 (sense) and R31 (antisense) were derived from nucleotides -137 to -1 15 and from nucle- 
otides 192 to 173, respectively, of the HCV1 sequence. The sequences of the primers are the following. 

P32 primer 

5' AAC CCQ CTC AATGCC TGG AGA TT3' 
R31 primer 

5' GGC COX CQA GCC TTQ QQG AT 3' 
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where X ■ A or Q 

[0163] The sequence of the region in the J1 isolate which encompasses the 5'-untranslated region as well as a part 
of the region of the putative "Core" is shown in Figure 18. In the figure, amino acids encoded in the J 1 sequence are 
shown above the nucleotide sequence. The sequence of the prototype HCV1 is shown below the J1 sequence; the 
dashes indicate sequence homology with J1. The differing amino acids encoded in the HCV1 sequence are shown 
below the HCV1 sequence. 

[0164] An HCV cDNA fragment which is a representative of the 600 bp J1 sequence descrfoed above (TC 600bp) 
was cloned into pGEM3Z and maintained in host HB1 01 ; the HCV cONA fragment may be removed with Sad and Xbal. 
This material is on deposit with the ATCC. 

[0165] Patent Microorganism Depository-deposited under Budapest Treaty terms. 



Deposited Materials 


Accession Number 


Deoosit Date 








E. cdi DH5fcS1-8791a 


BP-2593 


9/15/1989 


(This done contains 427 bp of the HS5 domain of J 1) 


E_CriiHBl01/pUM216c 


BP-2594 


9/15/1989 


(This clone contains 351 bp of the E/NS1 domains of J1) 


E, Wfi HB101/pU1-4652d 


BP-2595 


9/15/1989 


(This clone contains 


583 bp of the NS3 domain of J1) 


E.coii DH5o4>S1-713c 


BP-2637 


11/1/1989 


(This clone contains 580 bp of the E domain of J 1) 


E, rfj DH5a/|pS7-28c 


BP-2638 


11/1/1989 


(This clone contains 


552 bp of the C/E domain of J7) 


E^DH5a/ps1-1519 


BP3081 


8/30/90 



[0166] The following vectors descrfced in the Examples were deposited with the American Type Culture Collection 
(ATCC). 12301 ParWawn Dr. Rockville, Maryland 20852, and have been assigned the following Accession Numbers. 
The deposits were made under the terms of the Budapest Treaty. 



Deposited Materials 


Accession Number 


Deoosit Date 








TC-600BP (in 5, cofj HB101/t>GEM3Z) 


68393 


9/11/90 


JKMOObp (in E, cofj HB101/pGEM3Z) 


68394 


9/11/90 


AW-300bp (in E. coli HB101/pGEM3Z) 


68392 


9/11/90 


AW-770bp-N (in E. coli HB101/pM1E) 


68395 


9/11/90 


AW-700bp-C or AW-700bp-N (In LcqH DH5a-F/M13mp10) 






J1 5-1-1 fin E. coli DH5a-F/M13mp10) 






HCV-J1 X $22 


40884 


9/6/90 



[0167] These deposits are provided for the convenience of those stilled in the art These deposits are neither an 
admission that such deposits are required to practice the present invention nor that equivalent embodiments are not 
within the skill of the art in view of the present disclosure. The public availability of these deposits is not a grant of a 
license to make, use or sell the deposited materials under this or any other patent The nucleic acid sequences of the 
deposited materials are incorporated in to present disclosure by reference, and are controlling if in conflict with any 
sequences described herein. 

[01 68] While the present invention has been described by way specific examples for the benefit of those in the field, 
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the scope of the invention is not limited as additional embodiments will be apparent to those of skill in the art from the 
present disclosure. 

Claims 

5 

Claims for the following Contracting States : AT, BE, CH, U, DE, DK, FR, GB, GR, IT, LU, NL, SE 

1 . A polynucleotide in substantially isolated form comprising a nucleotide sequence of at least 1 5 nucleotides from a 
J-1 HCV isolate, said J-1 HCV isolate having at least 90% nucleotide sequence homology with the J-1 sequence 

io of any one Of Figures 7 to 10 or 13 to 18, wherein said nucleotide sequence of at least 15 nucleotides is distinct 
from the nucleotide sequence of HCV isolate HCV-1 

2. A polynucleotide according to daim 1 wherein the J-1 HCV isolate has at least 95% homology with the J-1 HCV 
sequence of any one of Figures 7 to 10 or 13 to 18. 

15 

3. A porynucieotide according to claim 1 wherein the J-1 HCV isolate has 100% homology with the J-1 HCV sequence 
of any one of Figures 7 to 10 or 13 to 18. 

4. A polynucleotide according to any one of the preceding claims which comprises at least 20 nucleotides. 

20 

5. A method of detecting HCV polynucleotides in a test sample comprising: 

(a) providing a polynucleotide as defined in any one of claims 1 to 4 as a probe; 

(b) contacting the test sample and the probe under conditions that allow for the formation of a polynucleotide - 
2$ duplex between the probe and its complement in the absence of substantial polynucleotide duplex formation 

between the probe and non-HCV polynucleotide sequences present in test sample; and 

(c) detecting any polynucleotide duplexes comprising the probe. 

6. A porynucieotide comprising a sequence of at least 15 nucleotides from a J-1 HCV isolate present in any one of 
30 plasmids pS1-8791a. bU1-1216c bU1-4652d. pS1-713c pSM519, TC-600BP, JH-400BP, AW-300BP, AW-770- 

BP-N, or J1 5-1-1 deposited under accession numbers BP-2593, BP-2594, BP-2595, BP-2637, BP-3081, 68393, 
68394, 68392, 68395 and 40884 respectively, wherein said nucleotide sequence of at least 15 nucleotides is dis- 
tinct from the nucleotide sequence of HCV isolate HCV-1 . 

35 7. A purified polypeptide comprising an amino acid sequence which: 

(a) is encoded by a nucleotide sequence as defined in any one of claims 1 to 4 or in the HCV sequences 
deposited and defined in daim 6, said coding being in frame with the corresponding amino acid sequences set 
out in Figures 7 to 10 or 13 to 18; 
40 (b) comprises an antigenic determinant; and 

(c) is distinct from the sequence of the polypeptides encoded by the HCV isolate HCV-1 

8. A polypeptide according to daim 7 which comprises at least 10 amino adds. 

45 9. A polypeptide according to daim 7 which comprises at least 15 amino acids. 

10. A polypeptide according to any one of daims 7 to 9 in»rx)bilised on a solid support 

1 1 . An immunoassay for detecting the presence of anti-HCV antibodies in a test sample which comprises: 

so 

(a) incubating the test sample under conditions that allow the formation of an antigen-antibody complex to be 
formed with a polypeptide as defined in any one of claims 7 to 10, wherein the polypeptide is not immunologi- 
cally cross-reactive with HCV-1 ; and 

(b) detecting any antigen antibody compiexes formed. 

55 

12. An immunoassay according to claim 1 1 wherein the test sample comprises human blood or a fraction thereof. 



25 



EP 0419 182 B1 



Claims for the following Contracting State : ES 

1 . A method of preparing a polynucleotide in substantially isolated form which comprises a nucleotide sequence of at 
least 1 5 nucleotides from a J-1 HCV isolate, said J-1 HCV isolate having at least 90% nucleotide sequence homol- 
ogy with the J-1 sequence of any one of Figures 7 to 10 or 13 to 18. wherein said nucleotide sequence of at least 
1 5 nucleotides is distinct from the nucleotide sequence of HCV isolate HCV-1 , said method comprising: 

a) chemical synthesis by methods known perse; or 

b) DNA replication by methods known perse; or 

c) transcription or reverse transcription by methods known perse; or 

d) restriction enzyme digestion and ligation into a vector by recombinant DNA techniques known per se. 

2. A method of preparing a polypeptide comprising an amino acid sequence which is encoded by a nucleotide 
sequence as defined in claim 1 , said method comprising: 

(i) chemical synthesis by methods known perse; or 

00 translation from the corresponding nucleic acid sequence by methods Known perse; or 

Oii) expression of a recombinant expression system comprising the nucleotide sequence encoding the 

polypeptide by methods taown per se; or 

Ov) isolation from virus by methods known perse. 

3. A method of detecting HCV polynucleotides in a test sample comprising: 

(a) providing a polynucleotide as defined in claim 1 as a probe; 

(b) contacting the test sample and the probe under conditions that allow for the for mati on of a polynucleotide 
duplex between the probe and its complement in the absence of substantial polynucleotide duplex formation 
between the probe and non-HCV polynucleotide sequences present in test sample; and 

(c) detecting any polynucleotide duplexes comprising the probe 

4. An immunoassay for detecting the presence of anti-HCV antibodies in a test sample which comprises: 

(a) incubating the test sample under conditions that allow the formation of an antigen-antibody complex to be 
formed with a polypeptide which is encoded by a nucleotide sequence as defined in claim 1 or a sequence of 
at least 15 nucleotides from a J-1 HCV isolate present in any one of plasmids pS1-8791a, bU1-1216c bU1- 
4652d. pS1-713c pSl-1519, TC-600BP. JH-400BP, AW-300BP, AW-770-BP-N or J1 5-1-1 deposited under 
accession numbers BP-2593, BP-2594, BP-2595, BP-2637, BP-3081, 68393, 68394, 68392, 68395 and 
40884 respectively, wherein said nucleotide sequence of at least 1 5 nucleotides is distinct from the nucleotide 
sequence of HCV isolate HCV-1 , said coding being in frame with the corresponding amino acid sequences set 
out in Figures 7 to 10 or 1 3 to 18, which polypeptide comprises an antigenic determinant and is distinct from 
the sequence of the polypeptide encoded by the HCV isolate HCV-1 and is not irrtrnunotogicaliy cross-reactive 
with HCV-1; and 

(b) detecting any antigen antibody complexes formed. 

5. An immunoassay according to claim 6 wherein the test sample comprises human blood or a fraction thereof. 

6. A polynucleotide in substantially isolated form comprising a nucleotide sequence of at least 1 5 nucleotides from a 
J-1 HCV isolate, said J-1 HCV isolate having at least 90% nucleotide sequence homology with the J-1 sequence 
of any one of Figures 7 to 10 or 13 to 18. wherein said nucleotide sequence of at least 15 nucleotides is distinct 
from the nucleotide sequence of HCV isolate HCV-1 . 

7. A polynucleotide according to claim 6 wherein the J-1 HCV isolate has at least 95% homology with the J-1 HCV 
sequence of any one of Figures 7 to 10 or 13 to 18. 

8. A polynucleotide accortfing to claim 6 wherein the J-1 HCV isolate has 1 00% homology with the J-1 HCV sequence 
of any one of Figures 7 to 10 or 13 to 18. 

9. A polynucleotide according to any one of claims 6 to 8 which comprises at least 20 nucleotides. 
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10. A polynucleotide comprising a sequence of at least 15 nucleotides from a J-1 HC V isolate present in any one of 
plasmids pS1*8791a, bU1 -1216c. bU1-4652d, pSl-713c. pS1-1519, TC-600BR JH-400BP, AW-300BP, AW-770- 
BP-N or J1 5-1-1 deposited under accession numbers BP-2593. BP-2594, BP-2595, BP-2637, BP -3081, 68393. 
68394, 68392, 68395 and 40884 respectively, wherein said nucleotide sequence of at least 15 nucleotides is dis- 

5 tinct from the nucleotide sequence of HCV isolate HCV-1 . 

1 1 . A purified polypeptide comprising an amino acid sequence which: 

(a) is encoded by a nucleotide sequence as defined in any one of claims 6 to 9 or in the HCV sequences 
w deposited and defined in claim 10. said coding being in frame with the corresponding amino acid sequences 

set out in Figures 7 to 10or 13to 18; 

(b) comprises an antigenic determinant; and 

(c) is distinct from the sequence of the polypeptide encoded by the HCV isolate HCV-1 . 
i$ 12. A polypeptide according to claim 1 1 which comprises at least 10 amino acids. 

13. A polypeptide according to claim 1 1 which comprises at least 15 amino adds. 

14. A polypeptide according to any one of claims 1 1 to 13 immobilised on a solid support 

20 

PatentansprOche 

PatentansprOche fQr fdJgende Vertragsstaaten : AT, BE, CH, U, DE, DK, FR, GB, GR, IT, LU, NL, SE 

25 1. Porynudeotid in im wesentfichen isolierter Form, umfassend eine Nudeotidsequenz aus mindestens 15 Nudeoti- 
den aus einem J-1 HCV-lsdat wobei das J-1 HCV-tsolat eine mindestens 90%ige Nudeotio&eo^enzhomologie mit 
der J-1-Sequenz einer der Figuren 7 bis 10 oder 13 bis 18 besitzt, wobei sich die Nudeotidsequenz aus minde- 
stens 1 5 Nudeotiden von der Nudeotidsequenz des HCV-lsoiats HCV-1 , wie in Figur 12 gezeigt unterscheidet 

so 2. Pofynudeotid nach Anspruch 1 , wobei das J-1 HCV-lsoiat eine mindestens 95%ige Homologie mit der J-1 HCV- 
Sequenz einer der Figuren 7 bis 10 Oder 13 bis 18 besitzt 

3. Pofynudeotid nach Anspruch 1, wobei das J-1 HCV-lsdat eine 100%ige Homdogje mit der J-1 HCV-Sequenz 
einer der Figuren 7 bis 10 Oder 13 bis 18 besitzt 

35 

4. Porynudeotid nach einem der vorstehende Anspruche, das mindestens 20 Nudeotide umfaBt 

5. Verfahren zum Nachweis von HCV-Polynudeotiden in einer Testprobe, umfassend: 

40 (a) Bereitstellen eines Pofynudeotids, wie in einem der Anspruche 1 bis 4 deftniert als Sonde, 

(b) In-Kontakt-Bringen der Testprobe und der Sonde unter Bedingungen, die die Bildung von Polynudeotid- 
Doppdstrangen zwischen der Sonde und deren Kompiementarstrang in Abwesenhert einer wesenUichen 
Polynudeotid-Doppetetrangbildung zwischen der Sonde und Nicht-HCV-Polynudeotidsequenzen, cfie in der 

45 Testprobe vorhanden sind, erlauben; und 

(c) Nachweis von alien Mynudeotid-Doppelstrangen. die de Sonde umfassen. 

6. Potynudeotid, umfassend eine Sequenz aus mindestens 15 Nudeotiden aus einem J-1 HCV-lsolat das in einem 
so der Plasmide pS1-8791a, bU1 -1216c, bU1-4652d, pSl-713c, pSM519, TC-600BP, JH-400BP, AW-300BP, AW- 

770-BP-N Oder J 1 5-1-1 hinterlegt unter den HimeriegungsriurTimern BP-2593. BP-2594, BP-2595, BP-2637, BP- 
3081, 68393, 68394, 68392, 68395 bzw. 40884 vorhanden ist, wobei sich die Nudeotidsequenz aus mindestens 
15 Nudeotiden von der Nudeotidsequenz des HCV-lsolats HCV-1 unterscheidet 

55 7. Gereinigtes Polypeptide umfassend eine Aminosauresequenz, die 

(a) von einer Nudeotidsequenz. die in einem der Anspruche 1 bis 4 def iniert ist Oder von den hinterlegten und 
in Anspruch 6 definierten HCV-Sequenzen codiert wird. wobei die Codierung im Leseraster mit den entspre- 
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chenden Aminosauresequenzen, die in den Rguren 7 bis 10 oder 13 bis 18 dargestelK sind, ist; 
(b) eine antigens Determinants umfaBt; und 
5 (c) sich von der Sequenz der von dem HCV-lsolat HCV-1 codierten Polypeptide unterscheidet 

8. Potypeptid nach Anspruch 7, das mindestens 10 Aminosfluren umfaBt. 

9. Porypeptid nach Anspruch 7, das mindestens 15 Aminos&uren umfaBt. 

io 

10. Polypepbd nach einem der AnsprOche 7 bis 9, das auf einem festen Trflger immobilisiert ist 

1 1 . Immunoassay zum Nachwets der Anwesenheit von Anti-HCV-AntikOrpem in einer Testprobe, wobei man 

15 (a) die Testprobe unter Bedingungen inkubiert die die Bildung eines zu bildenden Antigen-AntikOrper-Kornple- 

xes rrit einem Pdypeptid. wie in einem der AnsprOche 7 bis 10 defWert, erlaubea wobei das Polypeptid mtt 
HCV-1 nicht irrtmunoiogisch tareuzreaktiv ist; und 

(b) alie gebildeten Arrtigen-AntikOiper-Kornplexe nachwetst 

20 

12. Immunoassay nach Anspruch 11, worin die Testprobe menschliches Blut oder eine Frakticn davon umfaBt 

PatentansprOche fflr folgenden Vertragsstaat : ES 

25 1 . Verfahren zur Hersteilung eines Polynudectids in im wesentlichen isoiierter Form, das eine Nudeotkteequenz aus 
mindestens 15 Nudeotiden aus einem J-1 HCV-lsoiat umfaBt wobei das J-1 HCV-lsolat eine mindestens 90%ige 
Nudeotkteequenzhornologie mit der J-1-Sequenz einer der Rguren 7 bis 10 oder 13 bis 18 besitzt wobei sich die 
Nucleotkteequenz aus mindestens 15 Nudeotiden von der Nudeotidsequertz des HCV-lsolats HCV-1 , unterschei- 
det. wobei das Verfahren umfaBt 

30 

a) chemische Synthase nach an sich bekannten Verfahren; oder 

b) DNA-Reptikabon nach an sich bekannten Verfahren; oder 

35 c) Transkription oder Umkehrtranskription nach an sich bekannten Verfahren; oder 

d) Restrikfonsenzymspaltung und Ligierung in einem Vektor durch an sich bekannte rekombinante DNA-Tech- 
niken. 

40 2. Verfahren zur Herstellung eines Poiypeptids, umfassend eine Aminosauresequenz, die von einer Nudeotidse- 
quertz, wie in Anspruch 1 definiert, codiert wird, wobei das Verfahren umfaBt 

(i) chemische Synthese nach an sich betartnten Verfahren; oder 

45 (ii) Translation von der entsprechenden Nudeins&uresequenz nach an sich bekannten Verfahren; oder 

Oil) Expression eines rekombinanten Expressionssystems, das die Nudeotkteequenz umfaBt, die das Porypep- 
tid codiert nach an sich bekannten Verfahren; oder 

so (jv) Isdation aus einem Virus nach an sich bekannten Verfiahren. 

3. Verfahren zum Nachweise von HCV-Polynudeotiden in einer Testprobe, umfassend: 

(a) Bereitstellen eines Polynudeotids, wie in Anspruch 1 definiert, als Sonde, 

55 

(b) In-Kbntakt-Bringen der Testprobe und der Sonde unter Bedingungen, (fie die Bildung eines Porynudeotid- 
Doppelstranges zwischen der Sonde und seinem Konplementdrstrang in Abwesenheit einer wesentlichen Bil- 
dung von Polynudeotid-Doppetetrangen zwischen der Sonde und den in der Testprobe vorhandenden Nicht- 
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HCV-Polynudeotidsequenzen ertauben; und 

(c) Nachweis aller Polynucleotid-Doppelstrange. die die Sonde umfassen. 
5 4. Immunoassay zum Nachweis der Anwesenheit von Anti-HCV-AntikOrpern in einer Testprobe. umfessend: 

(a) Inkubieren der Testprobe unter Bedingungen, die die Bildung eines zu bildenden Artigen-Antikflrper-Kbm- 
pJexes mit einem Polypeptid ertauben, das von einer Nudeotidsequenz, wie in Anspruch 1 def iniert, Oder einer 
Sequenz aus mindestens 15 Nudeotiden aus einem J-1 HCV-lsolat, das in einem der Plasmide pS1-8791a, 

10 WJ1-1216C bU1-4652d, pS1-713c pS1-1519. TC-600BP, JH-400BP, AW-300BP, AW-770-BP-N oder J1 5-1-1, 

die unter den Hinterlegungsnummem BP-2593, BP-2594, BP-2595, BP-2637, BP-3081, 68393, 68394, 68392, 
68395 bzw. 40884 hintertegt wurden, vomanden tst, codiert wind, wobei sich die NucJeotidsequenz aus minde- 
stens 15 Nudeotiden von der Nudeotidsequenz des HCV-teo-Jats HCV-1 unterscheidet wobei die Codierung 
im Leseraster mit den entsprechenden Aminosauresequenzen der Figuren 7 bis 10 Oder 13 bis 18 ist. wobei 

is das Polypeptid eine antigene Determinante umfaSt und sich von der Sequenz des von dem HCV-lsolat HCV- 

1 codierten Pdypeptids unterscheidet und mit HCV-1 nicht immunologisch kreuzreaktiv ist; und 

(b) Nachweis aller gebildeten Antigen-AntikOrper-Komplexa 

20 5. Immunoassay nach Anspruch 6, worin die Testprobe menschiiches Blut Oder eine Fraktion davon umfaBt 

6. Potynudeotid in im wesenttichen isdierter Form, umfassend eine Nudeotidsequenz von mindestens 15 Nudeoti- 
den aus einem J-1 HCV-lsolat wobei das J-1 HCV-lsolat mindestens 90% Nudeoddsequenz-Homologie mit der J- 
1 -Sequenz einer der Figuren 7 bis 10 Oder 13 bis 18 besitzt wobei sich die Nudeotidsequenz von mindestens 15 

25 Nudeotiden von der Nudeotidsequenz des HCV-lsolats HCV-1 unterscheidet 

7. Potynudeotid nach Anspruch 6, wobei das J-1 HCV-lsolat eine mindestens 95%ige Homoiogie mit der J-1 HCV- 
Sequenz einer der Figuren 7 bis 10 Oder 13 bis 18 besitzt. 

so 8. Potynudeotid nach Anspruch 6, wobei das J-1 HCV-lsdat eine 100%ige Homdogie mit der J-1 HCV-Sequenz 
einer der Rguren 7 bis 10 Oder 13 bis 18 besitzt 

9. Potynudeotid nach einem der Anspruche 6 bis 8, das mindestens 20 Nudeotide umfaBt. 

35 10. Potynudeotid, umfassend eine Sequenz aus mindestens 15 Nudeotiden aus einem J-1 HCV-lsolat das in einem 
der Plasmide pS1-8791a, bU1-1216c ( bU1-4652d, pS1-713c, pS1-1519, TC-600BP, JH-400BP, AW-300BP, AW- 
770-BP-N oder J1 5-1 -1 . hintertegt unter den Hinteriegungsnummern BP-2593, BP-2594. BP-2595. BP-2637, BP- 
3081. 68393, 68394, 68392. 68395 bzw. 40884, vomanden ist wobei sich die Nudeotidsequenz aus mindestens 
15 Nudeotiden von der Nudeotidsequenz des HCV-lsolats HCV-1 unterscheidet 

40 

11. Gereinigtes Polypeptid, umfassend eine Amtnosauresequenz, die 

(a) von einer Nudeotidsequenz, wie in einem der Anspruche 6 bis 9 def iniert oder den HCV-Sequenzen, die 
hinteriegt wurden und in Anspruch 10 definiert sind. codiert wird, wobei die Codierung im Leseraster mit den 

45 entsprechenden Aminosauresequenzen, die in den Figuren 7 bis 10 oder 13 bis 18 dargesteift sind. erfolgt; 

(b) eine antigene Determinarrte umfaBt; und 

(c) sich von der Sequenz des von dem HCV-lsolat HCV-1 codierten Polypeptids unterschetdet. 

50 

12. Polypeptid nach Anspruch 1 1, das mindestens 10 Aminosauren umfaBt. 

13. Polypeptid nach Anspruch 1 1, das mindestens 15 Aminosauren umfaBt 

55 14. Polypeptid nach einem der Anspruche 1 1 bis 13. das auf einem fasten Trager immobilisiert ist 
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Revendications 

Revendlcatlons pour les Etats contractants sulvants : AT, BE, CH, U, DE, DK, FR, GB, GR, IT, LU, NL, SE 

5 1. Polynucleotide sous une forme sensiHement isoiee. comprenant une sequence nudeotidique cfau moins 15 
nucleotides d'un isolat HCV J-1. ledit isolat HCV J-1 ayant une homologie de sequence nudeotidique d'au moins 
90 % avec la sequence J- 1 de Tune quelconque des figures 7 a 1 0 ou 1 3 a 1 8, dans lequel ladite sequence nudeo- 
tidique d'au moins 15 nucleotides est differente de la sequence nudeotidique de llsolat HCV-1 de HCV, telle que 
representee k la figure 12. 

10 

2. Polynucleotide selon la revendication 1 dans lequel (Isolat HCV J-1 a une homologie d'au moins 95 % avec la 
sequence de HCV J-1 de Tune quelconque des figures 7 a 10 ou 13 a 18. 

3. Polynucleotide selon la revendication 1 dans lequel r isolat HCV J-1 a une homologie de 100 % avec la sequence 
is de HCV J-1 de Tune quelconque des figures 7 a 10 ou 13 a 18. 

4. Polynucleotide selon rune quelconque des revendications precedentes qui comprend au moins 20 nucleotides. 

5. Methode de detection de polynucleotides de HCV dans un echantillon de test comprenant : 

20 

(a) la fourntture (fun polynucleotide tel que defini dans I'une quelconque des revendications 1 k 4 comme 
sonde ; 

(b) la mise en contact de rechantilkxi de test et de la sonde dans des conditions qui permanent la formation 
cfun duplex pdynud6ofctique entre la sonde et son complement en rabsence de formation appreciable de 

2S duplex pofynud6otidique entre la sonde et des sequences polynucieotidiques autres que de HCV presentes 

dans rechantillon de test ; et 

(c) la detection de tous duplex polynudeotidiques comprenant la sonde. 

6. Polynucleotide comprenant une sequence d'au moins 15 nucleotides d'un isolat HCV J-1 present dans I'un quel- 
30 conque des ptasmides pS1-8791a. bU1-1216c. bu1-4652d. pS1-713c. pS1-1519. TC-600BP. JH-400BP, AW- 

300BP, AW 770-BP-N ou J1 5-1 -1 deposes sous les numeros (faeces BP-2593. BP-2594, BP-2595, BP-2637, BP- 
3081 . 68393. 68394, 68392, 68395 et 40884 respectivement, ladite sequence nudeotidique d'au moins 1 5 nud6o- 
tides etant drff6rente de la sequence nudeotidique de llsolat HCV-1 de HCV. 

38 7. Polypeptide purifie comprenant une sequence d'acides amines qui : 

(a) est codee par une sequence nudeotidique telle que definie dans I'une quelconque de6 revendications 1 a 
4 ou dans les sequences de HCV d6posees et definies a la revendication 6, ladite sequence codante etant en 
phase avec les sequences d'acides amines correspondantes presentees dans les figures 7 a 1 0 ou 13 a 18 ; 
40 (b) comprend un determinant antgenique ; et 

(c) est cfistincte de la sequence des polypeptides codes par I Isolat HCV-1 de HCV. 

8. Polypeptide selon la revendication 7 qui comprend au moins 10 acides amines. 

45 9. Polypeptide selon la revendication 7 qui comprend au moins 15 acides amines. 

10. Polypeptide selon I'une quelconque des revendications 7 a 9 immobilise sur un support solide. 

1 1 . Test immunologique destine a detector la presence d'anticorps ant-HCV dans un echantillon de test qui comprend 

50 

(a) I'incubation de rechantillon de test dans des conditions qui permettent la formation d'un complexe antigene- 
anticorps a former avec un polypeptide tel que defini dans Tune quelconque des revendications 7 a 1 0, le pdy- 
peptide ne produisant pas de reaction immunologique crotsee avec HCV-1 ; et 
55 (b) la detection de tout complexe antigene-anticorps forme. 

12. Test immunologique selon la revendication 1 1 dans lequel rechantiUon de test comprend du sang humain ou une 
fraction de celui-d. 
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Revendlcatlons pour I'Etat contractant sulvant : ES 

1. Methode de preparation d'un polynucleotide sous une forme sensibtement isolee qui comprend une sequence 
nudeotidique cTau moins 15 nucleotides d'un isolat HCV J-1, (edit isoiat HCV J-1 ayant une homologie de 
s sequence nudeotidique d'au moins 90 % avec la sequence J-1 de Tune quelconque des figures 7 k 10 ou 13 k 1 8. 
ladite sequence nudeotidique d'au moins 15 nucleotides etant differente de la sequence nudeotidique de I'isdat 
HCV-1 de HCV, ladite methode comprenant :. 

a) la synthese chimique par des methodes connues perse ; on 
10 b) la replication d'ADN par des methodes connues perse ; ou 

c) la transcription ou retrotranscription par des methodes connues perse ; ou 

d) la digestion par des enzymes de restriction at la ligature dans un vecteur par des techniques de I'ADN 
recombinant connues per se. 

is 2. Methode de preparation (fun polypeptide comprenant une sequence d'addes amines qui est codes par une 
sequence nudeotidique telle que definie k la revendication 1 , ladite methode comprenant : 

(i) la synthese chimique par des methodes connues per se ; ou 

00 la traduction k partir de la sequence d'acide nudeique correspondante par des methodes connues per se 
20 ; ou 

(i") rexpression d'un systeme d'expression recombinant comprenant la sequence nudeotidique codant pour le 

polypeptide par des methodes connues per se ; ou 

fiv) ITsolement k partir de virus par des methodes connues per se. 

25 3. Methode de detection de polynudeotides de HCV dans un echantillon de test comprenant : 

(a) la fourniture dun pofynudeotide teJ que defini k la revencScations 1 comme sonde ; 

(b) la mise en contact de rechantillon de test et de la sonde dans des conditions qui permettent la formation 
d'un duplex polynudeotidique entre la sonde et son complement en I'absence de formation appreciable de 

50 duplex polynudeotidique entre la sonde et des sequences potynudeotidiques autres que de HCV preserves 

dans rechantillon de test ; et 

(c) la detection de tous dipiex potynudeotidiques comprenant la sonde. 

4. Test immunologique destine k detector la presence d'anticorps anti-HCV dans un echantillon de test qui comprend 

35 

(a) Hncubation de rechantillon de test dans des conditions qui permettent la formation d'un complexe antigene- 
anticorps k former avec un polypeptide qui est code par une sequence nucieotkfique telle que definie k la 
revendication 1 ou une sequence d'au moins 15 nudeotides d'un isolat HCV J-1 present dans run quelconque 

40 des plasmides pSl-8791a ( bUM216c bU1-4652d, pSl-713c. pS1-1519. TC-600BP, JH-400BR AW-300BP, 

AW 770-BP-N ou J1 5-1-1 deposes sous les numeros d'acces BP-2593, BP-2594, BP-2595, BP-2637, BP- 
3081, 68393, 68394, 68392. 68395 et 40884 respectrvement. ladite sequence nudeotidique d'au marts 15 
nudeotides etant differente de la sequence nudeotidique de ilsolat HCV-1 de HCV, ladite sequence codante 
etanten phase avec les sequences d'acides amines correspondantes representees dans les figures 7 k 10 ou 

45 13618, lequel polypeptide comprend un determinant antigenique et est distinct de la sequence du polypeptide 

code par Ilsolat HCV-1 de HCV et ne produh pas de reaction immunologique croisee avec HCV-1 ;et 

(b) la detection de tous complexes antigene-anticorps formes. 

5. Test immunologique selon la revendication 6 dans lequel rechantillon de test comprend du sang humain ou une 
so fraction de celui-d. 

6. Polynudeotide sous une forme sensWement isolee comprenant une sequence nudeoticSque d au moins 15 
nudeotides d'un isolat HCV J-1, 1 edit isolat HCV J-1 ayant une homologie de sequence nudeotidique d'au moins 
90 % avec la sequence J-1 de I'une quelconque des figures 7 k 10 ou 1 3 k 18, dans lequel ladite sequence nudeo- 

55 tidique d'au moins 15 nucleotides est differente de la sequence nudeotidique de Ilsolat HCV-1 de HCV. 

7. Potynudeotide selon la revendication 6 dans lequel ilsolat HCV J-1 a une homologie d'au moins 95 % avec la 
sequence de HCV J-1 de I'une quelconque des figures 7 k 10 ou 1 3 k 1 8. 
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8. Polynucleotide selon la revendication 6 dans lequel I'isolat HCV J-1 a une homologie de 100 % avec la sequence 
de HCV J-1 de Tune queiconque des figures 7 & 10 ou 13 & 18. 

9. Polynucleotide selon Tune queiconque des revendications 6 & 8 qui comprend au moins 20 nucleotides. 

10. Polynucleotide comprenant une sequence d'au moins 15 nucleotides d'un tsdat HCV J-1 present dans run quei- 
conque des plasmides pS1-8791a, bl)M216c. bU1-4652d, pS1-713c pS1-1519. TC-600BP, JH-400BP, AW- 
300BP, AW 770-BP-N ou J1 5-1 -1 deposes sous les numeros d'acces BP-2593. BP-2594, BP-2595, BP-2637, BP- 
3081 . 68393, 68394. 68392, 68395 et 40884 respectivement, ladite sequence nudeotidique d'au moins 1 5 nucleo- 
tides etant drfferente de la sequence nudeotidique de I'isolat HCV-1 de HCV. 

1 1 . Polypeptide purif ie comprenant une sequence d'acides amines qui : 

(a) est codee par une sequence nudeotidique telle que dtfinie dans I'une queiconque des revendications 6 k 
9 ou dans les sequences de HCV deposees et def inies * la revendication 1 0. ladite sequence codante etant 
en phase avec les sequences d'acides amines correspondantes presentees dans les figures 7 & 10 ou 13 A 18 

» 

(b) comprend un determinant antigenique ; et 

(c) est cfistincte de la sequence du polypeptide code par llsoiat HCV-1 de HCV. 

12. Polypeptide selon la revendication 11 qui comprend au moins 10 addes amines. 

13. Polypeptide seton la revendication 1 1 qui comprend au moins 15 acides amines. 

14. Polypeptide selon i'une queiconque des revendications 11 & 13 immobilise sur un support solida 
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J i 1 AGC c g agt agtgttgggtcgcgaaaggccttgtggt actgc ctg at aggctg CTTG CC AG 60 

discrepancy 

clone 

altered aa 

* • • » . . 

!1etSerThrAsnProLysProGlnAr$Lys 

JT 61 tgccccckk;agctctcgtagaccgtgcatcatga(x^c.kaatcctaaaccccaa.\gaaa.\ 120 

TG 
bl 
---Ar* 

* • • • • . 

ThrLysArcAsnThrAsnArcArcProGlnAspValLysPheProGlyGlyGlyGlnlle 

J 7 121 ACC AAACGTAAC ACC AACCGTCGCCCACAGGACGTTAAGTTCCCGGGCGGTGGTCAGATC L 80 

c r 

b 1 
— Leu 

* • • ■ * . 

ValGlyClyValTyrLeuLeuProArsAriGlyProArrLeuGlyValArjAlaThrArc 
JT 181 GTCGGTGGAGTTTACTTGTTGCCGCGCAGGGGCCCCAGGTTGGGTGTGCGTGCGACTAGG 240 
T A 
b b 



LysThrSerGluArgSerGlnProAriGlyArgArgGlnProIleProLysAlaArcAr* 
JT 241 .AAGACTTCCGAGCGGTCGCAACCTCGTGGAAGGCGACAACCTATCCCCAAGGCTCGCCGG 300 

A 
b 

* • > ■ • • 

ProGluGlyArfThrTrpAlaClnProGlyTyrProTrpProLeuTyrGlyAsnGluCly 
JT 301 CCCGAGCG<>GGACCTGGGCTCAGCCTGGGTATCCTTGGCCCCTCTATGGCAATGAGGG^ 360 



UttGlyTrpAlaGlyTrpLeuLeuSerProArcGlySerArfProSerTrpGlyProAsn 
JT 361 ntXKtnXXXKaGGATGGCTCC^ 420 
A T C 

b c b 

DID — Thr 

* • • • * * 

AspProArfArcArcSerArcAsnLeuGlyLysVallleAspThrLeuthrCysGiyPhe 
JT 421 GACCCCCGGCGTAGGTCGCGTAATTrGGCTAAGGTCATCGATACCCTTACATGCGGCTTC 480 

C 
1 

Leu 

FIGURE L-l 
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* « * • • • 

AlaAapLeuMetGlyTyrlleProLeuValGlyAlaProLeuGlyClyAlaAlaArffAla 
J7 481 GCCGACCTCATOXKTACAmCGCTrCT^ 540 

c c 

c b 



LeuAlaHisGly 
J7 54 1 CTGGCACATGGT 552 



FIGURE i-2 
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• * * • • * 

ProLeuValGlyAlaProLeuGlyGlyAlaAlaAr«AlaLeuAl*HisGlyVaUrgVal 
J 1 i TCCGCTCGTCGGCGCCCCCTTAC^KJGGCGCTGCCAGGGCCCTGGCACATGGTGTCCGGGTT 6 1 
discrepancy C 
clone d 
altered aa Ser 

****** 

LeuGluAspGlyValAsnTyrAlaThrGlyAsnLeuProGlyCysSerPheSerllePhe 
62 CTGGAGGACGGCGTGAACTATGCAACAGGGAATTTGCCCGGTTGCTCTTTCTCTATCTTC 121 



LeuLeuAlaLeuLeuSerCysLeuThrlleProAlaSerAlaTyrGluValAr^AanVal 
L22 CTCTTGGCTCTGCTGTCCTGTTTGACCATCCCAGCTTCCGCTTATGAAGTGCGCAACGTG 181 
A T 
9 d 



SerGlylleTyrHisValThrAsnAapCysSerAsnSerSerlleValTyrGluAlaAla 
182 TCCGGGATATACCATGTCACAAACGACTGCTCCAACTCAAGCATTGTGTATGAGGCGGCG 24 1 

T 

d 



AspVallleNetHlsAlaProGlyCysValProCysValAriGluAsnAsnSerSerArr 
242 GACGTGATCATGCATGCCCCCGGGTGCGTGCCCTGCGTTCGGGAGAACAATTCCTCCCGT 30 1 

C 
d 



CysTrpValAlaLeuThrProThrLeuAlaAlaArgAsnAlaSerValProThrThrThr 

302 TGCTGGGTAGCGCTCACTCCCACGCTCGCGGCCAGGAATGCCAGCGTCCCCACTACGACA 361 



LeuArcArcHlsValAapUuUuValGlyThrAlaAlaPneCysSerAlaMetTyrVal 

362 TTACGACGCCACCTCGACTTGCTCGTTGGGACGG Cl OC I'l' 1 CTGCTCCGCT ATGTACGTG 421 
G 
d 



FIGURE 2-1 
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GlyAspLeuCysGlySerValPheLeuIleSerClnLeuPheThrPheSerProAr^Arff 
4 22 GGGGATCTCTGCGGATCTGTTTTCCTCATCTCCCAGCTGTTCACCTTCTCGCCTCGCCCG 48 1 

T 
d 



HlsGluThrValGlnAspCysAsnCysSerlleTyrProGlyHlsValSerGlyHisArc 
482 CATGAGACAGTACAGGACTGCAACTGCTCAATCTATCCCGGCCACGTATCAGGCCATCGC 541 

T 

c 



MetAlaTrpAs pile tttetMetAsnTrpSerProThr Ala 
542 ATGGCTTGGGATATGATGATGAACTGGTCGCCCACGGCA 580 



FIGURE 2-2 
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AsnTrpSerProThrAla 

Jl l AACTGGTCGCCCACGGCA 18 

discrepancy : : 

done 

altered aa 

• * « » • ■ 

AlaLeuValValSerGlnLeuLeuArfflleProGlfiAlaValMetAspNerValAlaGly 

Jl 19 GCCTTAGTGGTGTCGCAGTTACTCCGGATCCCACAAGCTGTCATGGACATGGTGGCGGGG T8 



AlaHisTrpGlyValLeuAlaGlyLeuAlaTyrTyrSerfletValGlyAsnTrpAlaLys 

Jl 79 GCCCACTGGGGAGTCCTAGCCGGCCTTGCCTACTATTCCATCGTGGGGAACTGGGCTAAG 138 

G A 
L I 



ValLeuIleValMetLeuLeuPheAlaGlyValAspGlyHlsThrArgValThrGlyGly 
Jl 139 GTTTTGATTGTGATGCTACTCTTTGCCGGCGTTGACGGGCATACCCGCGTGACGGGGGGG 198 

AG A 
If 1 

Ser 

• • • • • • 
ValGlnGlyHlsValThrSerThrLeuThrSerLeuPheArcProGlyAiaSerGlnLys 

Jl 199 GTGCAAGGCCACGTCACCTCTACACTCACGTCCCTCTTTAGACCTGGGGCGTCCCAGAAA 258 

T G 
c i 

Ala 

• •*••• 
IleGlnLeuValAsnThrAsnGlyS^rTrpHisIleAsnArcThrAlaLeuAsnCysAsn 

Jl 259 ATTCAGCTTGTAMCACCAATGGCAGTTGGCATATCAACAGGACIT^ 318 

TC T T 
11 1 f 
SerLeu 

m 

AapterLtuGlnThrGlyPheLeuAlaAlaLeu 
Jl 319 GACTCCCTCCAAACTGGGTTCCTTGCCGCGCTG 351 



FIGURE 3 
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Jl 1 CTCA 4 

discrepancy 
dona 
altared aa 

VallleAspCysAsnnrCysValThrOlnThrVaiAspPheSarLauAspProThrPha 

Jl 5 GTGATCGACTGTAACACATGTGTCACTCAGACOT 64 

G 
c 

Ala 

ThrU^luThrThrllirVUProGinAapAlaVaisarArtnirGlnArsArcGiyAr^ 

Jl 65 ACCATCGAGACGACGACCGTGCCCCAAGATGCGGTTTCGCGCACGCAGCGGCGAGGTAGG 124 



ThrGlyAriGlyArcArfGlylltTyrArfPheValThrProGlyGluArcProSarAla 

Jl 125 ACTGGCAGGGGCAWAGAGGCATCTATAG<nTrCTWCTCCAG(UGAACGGCCCTCGGtt 164 



NetPheAspSerSarValLauCrsGluQrsTyrAspAlafilyQraAlaTrpmGluUa 
Jl 185 ATGTTCGATrCIT CW T CC T A TGTGAGTGTTATGACGCGGGCltirGCI TGGTATGAGCTC 244 

A 

Gly(-) 

ThrFi^aCluThrtoValArfUuAr^ 
Jl 245 ACGCCCGCTGAGACCTCGCTTAGGTTWGGGCTrACCTAAATACACCAGGGTTGtt 304 



CyiClnAapii lUuGluPbaTrpGluSarValPhaThrGlyUuT^iill^P^ • 

Jl 305 TGCCACCUCCATCraaGTTCTCCCAGAG 364 



• • * 

HUPhaUuSarGlnttrWlnAlaGlyAapAinFh^ 
Jl 365 CA U rCl 1 U I'Q CCAGACTAAOCAGGCAGGAGACAACrrCCCCTACCTGGTAGCATACCAA 424 



FIGURE 4-1 
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* * • • 

im *iJ^)I^^^«Al«lorjAlaPToProProSerTrpAspGlnitetTrpUysCys 

425 GCCACAGTGTGCGCCAGGCCTAAGGCrcCACCTCCATCGTGWATCAAAT^rTGGAACTCT 

C 
t 

Al*(.) 

... ^ 1 ^^^ tpp ^^i«61yPToThrProUuLwTyrArfUuoiyAl» 
485 CT^TACGCCrAAAGCCrACGCTCCACGtX^CWCCCCCCTGCTGTATAGGCTAGCAGCC 



Ala Ar»(«) 



ValGlnAsnGluV«lThrUuThrHliProZl«ThrI^t 

S4S GTCCAGAATGAGGTCACCCTCACACACCCTATAACCAAA 583 



FIGURE 4-2 

r 
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LeuThr 

J I 1 CCTCACC 7 

discrepancy 

clone 

altered aa 

ArgAapProThrValProLeuAlaArfAlaAlaTrpGluThrAlaArrtisThrProVai 
8 CGTGACCCCACCGTCCCCCTTGCCCGGCCTtXGTCGGACACAGCTACACACACTCCAGTC 67 

C 

Thr(-) 

AsnSerTrpLeuGlyAsnlielleMetTyrAlaProThrLeuTrpAlaArgMetlleLeu 

68 .AACTCCTGGCTAGGCAACATCATCATGTATGCGCCCACTTTGTGGGCAAGGATGATTCTG 127 

T 

t 

Ile<-> 

netThrHlsPhePheSerlleLeuLeuAlaGlnGluGlnLeuGluLysAiaLeuAspCys 
128 ATGACTCACTTCTTCTCCATCnTCTAGCCCAGGAGCAACTTGAAAAAGCCCTGGATTGT 187 



GlnlleTyrGlyAlaCysTyrSerlleGluProLeuAspLeuProGlnllelleGluAry 
188 CAAATCTACGGGGCCTGTTACTCCATTGAGCCACTTGACCTACCTCAGATCATTGAACGA 247 



LeuHlsGlyLeuSerAlaPheSerLeuHlsSerTyrSerProGlyGlulleAsnArfVal 

248 CTCCATGGTCTTAGCGCATTTTCACTCCATAGTTACTCTCCAGGTGAGATCAATAGGGTG 307 



AlaSerCysLeuArgLysLeuGlyValProProLeuArcValTrpArcHisArcAlaArf 
308 GCTTCATGCCTCAGGAAGCTTGGGGTACCACCCTTGCGAGTCTGGAGACATCGGGCCAGA 367 



SerValArcAlaUrsLeuLeuSerGlnGlyGlyArrAlaAlaThrLysGlyLysTyrLeu 
368 AGTGTCCGCGCTAAGCTACTGTCCCAAGGGGGGAGGGCCGCCACTTGTGGCM 427 

G 
ff 

Gln(-) 



FIGURE 5 
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* • « • • * 

J7 1 AGCCGAGTAGTCTTCGGTCGCGAAAGGCCTTGTGGTACTGCCTGATAGGGTGCTTGCGAG 60 

HCV1 



M«tS«rThrAsnProLysProGlnAr*Lys 
J7 61 TGCCCCGGGAGGTCTCGTAGACCGTGCATCATGAGCACAAATCCTAAACCCCAAAGAAAA 120 

HCV1 C G T A 

Lys 

• ••*•* 
ThrLyaArcAsnThrAsoAriArfProGlnAspValLysPheProGlyGlyGLyGlnlle 

JT 121 ACCAAACGTAACACCAACCGTCGCCCACAGGACGTTAAGTTCCCGGGCGGTGGTCAGATC 180 

HCV1 A C T C 

Asa 

* 4 * * * * 

ValGlyGlyValTyrLeuLeuProArcArfGlyProArtLeuClyValArcAlaThrArc 
JT 131 GTCGGTGGAGTrTACTrGTTGCCGCGCAGGGGCCCCAGGTrofGTGTGCCTGCGACTAGG 240 

HCV1 T T A . C G A 



LysThrSerGluArsSerGlnProArcGlyArcArcGloProIleProLysAlaArcArc 
J7 241 AAGACTTCCGAGCGGTCGCAACCTCGTGGAAGGCGACAACCTATCCCCAAGGCTCGCCGG 300 
HCV1 A T A T 0 T 



ProGluGlyArcThrTrpAiaGlnProGlyTyrProTrpProLeuTyrGlyAanGluGly 
J7 301 CCCGAGGGCAGGACCTGGGCTCAGCCTGGGTATCCTTGGCCCCTCTATGGCAATGAGGGC 360 

HCV1 C C 



UuGlyTrpAlaGIyTrpLeuLeuSerProArcGlySerArfProSerTrpGlyProAaa 
J7 361 TTGGGCTGGGCAGGATGGCTCCTGTCACCCCGCGGGTCTCGGCCTAGTTGG 420 
HCV1 GC G T T C CA 

Cy« Thr 

AspProArcArcArcStrArsAsaLtuGlyLysVallleAspThrLeuThrCysGlyPhe 
J7 421 GACCCCCGGCGTAGGTCGCGTAATTTGGGTAAGGTCATCGATACCCTTACATGCGGCTTC 480 

HCV1 C G 



FIGURE 6-1 
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AitA«pLcuM«tGlyTyrIleProLeuValClyAlaProLeuG:yGlyAlaAlaAr«Ala 
J7 481 GCCGACCTCATGGGGTACATTCCGCTTGTCGGCGCCCCCT~AGGCGCC^ 540 

HCVl AC TCT A 



LauAlafllsGly 
J7 541 CTGGCACATGGT 552 

HCVl G C 781 



FIGURE 6-2 
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ProUuValClyAlaProLeuGlyGlyAlaAiaArfAlaLeuAlaHlsGlyValAr*Val 
J I 1 TCCGCTCCTCGC^GCCCCCTTAGGGGGCGCTGCCAGGGCCCTGGCACATGGTGTCCGCGTT 61 

HCV1 A TC T A G C 



LcuGluAspGlyValAsnTyrAlaThrGlyAsnLeuProGlyCysSerPheSerllePhft 
Jl 62 CTGGAGGACGGCGTGAACTATGCAACAGGGAATTTGCCCGGTTGCTCTTTCTCTATCTTC 121 
HCV1 A CC T T 



J I 122 
HCVl 



Jl 182 

HCVl 



LeuLeuAlaLeuLeuSerCysLtuThrlleProAlaSerAlaTyrGluValArcAsnVal 
CTCTTGGCTCTGCTGTCCTGTTTGACCATCCCAGCTTCCGCTTATGAAGTGCGCAACGTG 181 
TC C C T C TG G C G C CC TCC 

Val Gin Ser 

♦•• 

SarGlylleTyrHisValThrAsnAjpCysSerAsnSerSerlleValTyrGluAlaAia 
TCCGGGATATACC\TGTCACAMCGACTGCTCCAACTCAAGCATTCTGTATGAGGCGGCG 24 1 
A OCT C CTTCT GT C C 

T&r Leu pro 



AspVallleMetHlsAlaProCiyCytValProCysValArfGluAsnAsnSerSerArc 
J I 242 GACGTGATCATGCATGCCCCCGGCTGCGTGCCCTGCGTTCGGGAGAACAATTCCTCCCGT 301 
HCVl TCCC CA T G CT TGGCG GAG 

Ala Leu Thr Gly Ala 

**# 

CysTrpValMaUuThrProThrUuAiaAlaArcAanMaSerValProTbrThrTftr 
Jl 302 TGCTGGCTAGCGCTCACTCCCACGCTCGCGGCCAGGAATGCCAGCGTCCCCACTACGACA 361 
HCVl T GAGCTGGCA GGAAC G G GAG 

let val Thr AapGlyUysLeu Ala Gin 

••• 

UuArcArtflaValAapLeuUuValciyThrAlaAl^eCysSerAlaMetTyrVal 

Jl 362 TTACSACCCCACGTCGAC I'HiC KUl 1 0G CACGGCI O C mi l'UC 1LCGCTATGTACGTG 421 
HCVl CT TA TC TC GC CA CC TGCCC 

Il« Ser ThrLeu Leu 



Jl 422 

HCVl 



GlyAspLeuCyaGlyScrValPhcLeuIleScrGlnLeuPheThrPheSerProArgArc 
GGGCATCTCTGCCCAT l 1 0 1 1 1 1 CCTCATCTCCCAG C T G T r CA CCTTCTCGCCTCGCCGG 481 
CA G CTTGGGA TCAGC 

ValGly 



FIGURE 7-1 
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* • • • * • 

HliCluThrValGlnAspCysAsnCysStrlleTyrProGlyHlsValSerGlyHHArf 
J 1 482 CATGACACAGTACAGCACTGCAACTCCTCAATCTATCCCGGCCACCTATCAGCCCATCGC 34 1 
HCV1 CTG GACG AGT T T TAAGTC 

Trp Thr Gly n«Thr 

MetAlaTrpAspMet.Vta«tA*oTrpStrProThrAla 
Jl 342 ATGGCTTGGGATATGATGATGAACTGGTCGCCCACGGCA 380 

HCV1 A C T A G 

Thr 



FIGURE 7-2 
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* 

VsnTrpSerProThrAla 

.11 1 \ACTGGTCGCCCACGGCA 18 

HCVL c T AG 

Thr 



AlaLeuVaiValSerGlnLeuLeuArflleProGlnAlaVaiaetAspMetValAlaGly 
Jl 19 GCCTTAGTGGTGTCGCAGTTACTCCGGATCCCACAAGCTGTCATGGACATGGTGGCGGGG 78 

HCV1 . G G AA G T C G at ACTT 

HetAla tleLeu He 

• • * > • 

AlaHlsTrpGlyValLeuAlaGlyLeuAlaTyrTyrScrSetValGlyAanTrpAlaLjrs 
Jl 79 GCCCACTGGGGAGTCCTAGCGGGCCTTGCCTACTATTCCATGGTGGGGAACTGGGCTAAG 138 
HCV1 T GAA GTTC G 

lie Phe 



V alLe uIleVaLnetLeuLeuPheAlaGlyValAspGlyHlsThrAfgValThrGlyGly 
Jl 139 GTTTTGATTGTGATGCTACTCTTTWCGGCGTTGACGGGCATACCCCCGTGA 198 
HCV1 CC G A C G A C C G A A C C A 

Val Leu AlaGlu His 

• • • • ■ * 
ValGlnGlyHlsValThrSerTbrLeuThrSerLeuPheArcProGlyAlaSerGlnLys 

Jl 199 GTGCAAGGCCACGTCACCTCTACACTCACGTCCCTCTTTAGACCTGGGGCGTCCCAGAAA 258 
HCV1 AGTGCC ACTGTG GG T TGTTAG C CGC AC CAAG C 

SerAla ThrVal GlyPheVal LeuAla Lys Asa 

•••••• •••••• 

• • • • » ■ 
IleGln LcuVa lAanThrAsnGlySerTrpHlsIleAanArfThrAlaLeuAanCysAtn 

Jl 259 ATTCAGL 1IU 1 AAACACCMTGGCAGTTtX^TATCAACAGGACTGCCCTGAACTGCAAT 313 
HCV1 G C GA C C CCT CG 

Val He Leu Ser 

• • • • 
AspSerLeuGlnTtirGlyPtteLeuAlaAlaLeu 

Jl 3 19 GACTCCCTCCAAACTGGGTTCCTTGCCGCGCTG 351 
HCV1 - " TAG ACCCGGTGAG T 

Asa Trp Gly 



FIGURE 8 
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7 . . S#rV4lli#AapCy»AanThrCyaValThrCXnThrVai 

HCV 1 - ^ ctcaotoatcgactot aacacatgtotcactc agacgctc 

HCV1 WCtAt4CCfCCf*Cttcr* G A etc c ^ 

• A3 pPht 3trL<uA»pProThrybeThrIliGXuThrThrThrVftlProGinAapAl&v*i 

nrvi 41 ^^^^ccttggatcccaccttcaccatccagacgacgaccgtgccccaagatgcggtT 

fiCV1 CTCT T ATCGCC 0 TC 

III 

„. , - i ftr ^riThrftlnArtArcni jArtThrOlyArtGlyArjirrGlylltTyrirjPhtVij 

nrv i 101 TCGCGCACGCAGCGCCGAGGTAGGACXQGCACGGGCAGGAGAGGCATCTATAGGTTTG7C 
HCV1 C TATOC GACC CA 

T^^'^lTOluArrProsirAlaMstPhiAtpStrS^rvtiLtttCytGiucyiTyrA^p 

*\ 161 ACTCCAGGAGAACGGCCCTCGGCGATGTTCGATrCTTCGGTCCTATGTGAGTGTTATGAC 

CV1 OAoaac c oc coc c c 

Alt oiy 

Jl 221 GCGGGCTGTGCTTGGTATGAGCTCACGCCCGCTGAGACCTCGGTTAGCTTGCGGGCTTAC' 
HCV1 A CTAACAAG 

Thr 

t* ^^»2ThrProClyL«uPfoValCyfGlnAtpHi»LeuGXuPhmpGiuS«rV«lPhc 

W/ £A 381 CT>^TACACCAGGGTTGCCCGTCTGCCAOGACCATCTGGAGTTCTOGGAOAGCCrrCTT 
HCYl AOCCOCT G TAT 0 f 

Mat gij • 

^ rGl ^ u ^^l»Il«A»pAlrtIsl^iL«uS«pGlhThpLyiGlnAiaGlyA3pAsn 
«rvi 41 ACATOC CTCACCCACATAQACqCCCA C TTCTT Q T C CCAGACTAAGCACGCAGGAGACAAC 
*** V1 TT T TC A A AGTGG 

S«r Glu 

£^££^£LwYalAlaTyrOlnAlaT^^ 

Jl 401 ttcccctacctogtaocataccaagccacagtotgcgccaoggctaaggctccacctcca 

*CVl CT T 0 c T CACTG 

gfrTrpAap01aMampLy»C»»t^nllaArgLett^ 
Jl 461 TCOTGGOATCAAATTWOMOTTICrCATACaGCTAAAGCCTACGCTGCAGGGGCCAACC 



HCV1 



OQ TflTCC CCCT A 



yroLattUttTyrAraLattOlyAlaValGlnAtnGluVfclTtoLeuThrfllaProxleTar 
-JIt 521 CCCCTGCTGTATAGGCTAGGAGCCGTCCAGAATflAGGT caccctcac acaccctataac c 
HCV1 ACAOCTT AA 00 AG C 

Xla Val 

66X AAA 

HCVt taeateatfacatacatrtc 

FIGURE 9 
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LtuThr 
Jl I CCTCACC T 

HCV1 

.\rfAspProThrValProLeuAiaArcAlaAlaTrp61urtirAlaArtfisThrProVai 
J I a CCTGACCCCACCCTCCCCCTTGCGCGGGCTGCGTOCCACACAGCTAGACACACTCCAGTC 6? 
HCVl . T AAC C A A A 

Thr 

AsnSerTrpLeuGlyAsnllelleMetTyrAlaProThrLeuTrpAiaArsftecIleLeu 
Jl 68 AACTCCTGGCTAGGCAACATCATCATGTATGCGCCCA Cl 'n Xi IG GCCAAGGATGATTCTG 127 

HCVl T A T C AC G A 

Phe 



MetThrHlsPhePheSerlleLeuLeuAi&GlnGluGInLettGluLysAlaLeuAspCys 
Jl 128 ATOACTCA C TT C nCTCCATCCTTCTAGCCCAGGAGCAACTTGAAAAAGCCCTGGATTOT 137 
HCVl c T TAG G A AG C 5 C G C C 

Val He ArgAsp Gin 

GlnlleTyrGlyAlaCysTyrSerlleGluProLeuAspLeuProGlnllelleGluArr 
Jl 188 CAAATCTACGGGGCCTGTTACTCCATTGAGCCACTTGACCTACCTCAGATCATTGAACGA 247 

HCVl GG c * A T CACA 

Glu Pro Gin 

LeuHlsGlyLeuSerAlaPheSerLeuHlsSerTyrSerProGlyGluIleAsnArrVal 
J 1 248 CTCCATGGTCTTAGCGCATTTTtACTCCATAGTrACTCTCCAGGTGAGATCAATAGGGTG 307 
HCVl C C C AT 

• • • • • * 
AlaSerCysLauArsLysLeuGlyValProProLeuArcValTrpArcfilsArcAlaArff 

Jl 308 GCTTCATGCCTCAGGAAGCTTGGGGTACCACCCTTGCGAGTCTGGAGACATC 367 

HCVl 05 AA G CT C'CG 

Ala Ala 

• ••••* 
- ' S«rValArcAlaIjrsLeuLeuSerClnGlyClyArtAlaAlaTtirLyiGlyLysTyTL«tt 

Jl 368 AGTGTCCGCGCTAAGCTACTGTCCCAAGGGGGCAGGGCCGCCACTTGT^ 427 
HCVl c GTGAGACTTA 

Arc AlaArr 11« 

FIGURE 10 
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HCV-1 



6 7 GCGTCTAGCCATGGCGTTAGTATGAGTGTCGTGCAGCCTCCAGGACCCCCCCTCCCGGGA 
CGCAGATCGGTACCGCAATCATACTCACAGCACGTCGGAGGTCCTGGGGGGGAGGGCCCT 

0 7 GAGCCATAGTGGTCTGCGGAACCGGTGAGTACACCGGAATTGCCAGGACGACCGGGTCCT 
CTCGGTATCACCAGACGCCTTGGCCACTCATGTGGCCTTAACGGTCCTGCTGGCCCAGGA 

4 7 TTCTTGGATCAACCCGCTCAATGCCTGGAGATTTGGGCGTGCCCCCGCAAGACTGCTAK 
AAGAACCTAGTTGGGCGAGTTACGGACCTCTAAACCCGCACGGGGGCGTTCTGACGATCG 

8 7 CGAGTAGTGTTGGGTCGCGAAAGGCCTTGTGGTACTGCCTGATAGGGTGCTTGCGAGTGC 
GCTCATCACAACCCAGCGCTTTCCGGAACACCATGACGGACTATCCCACGAACGCTCACG 

-27 CCCGGGAGGTCTCGTAGACCGTGCACC -1 
GGGCCCTCCAGAGCATCTGGCACGTGG 

MetSerThrAsnProLysProGlnLysLysAsnLysArgAsnThrAsnArgArgProGln 
1 ATGAGCACGAATCCTAAACCTCAAAAAAAAAACAAACGTAACACCAACCGTCGCCCACAG 
TACTCGTGCTTAGGATTTGGAGTTTTTTTTTTGTTTGCATTGTGGTTGGCAGCG 

AspValLysPheProGlyGlyGlyGlnlleValGlyGlyValTyrLeuLeuProArgArg 

6 1 GACGTCAAGTTCCCGGGTGGCGGTCAGATCGTTGGTGGAGTTTACTTGT^ 

CTGCAGTTCAAGGGCCCACCGCCAGTCTAGCAACCACCTCAAATGAACAACGGCGCGTCC 

GlyProArgLeuGlyValArgAlaThrArgLysThrSerGluArgSerGlnProArgGly 
121 GGCCCTAGATTGGGTGTGCGCGCGACGAGAAAGACTTCCGAGCGGTCGCAACCTCGAGGT 
CCGGGATCTAACCCACACGCGCGCTGCTCTTTCTGAAGGCTCGCCAGCGTTGGAGCTCCA 

ArgArgGlnProIleProLysAlaArgArgProGluGlyArgThrTrpAlaGlnProGly 
181 AGACGTCAGCCTATCCCCAAGGCTCGTCGGCCCGAGGGCAGGACCTGGGCTCAGCCCGGG 
TCTGCAGTCGGATAGGGGTTCCGAGCAGCCGGGCTCCCGTCCTGGACCCGAGTCGGGCCC 

TyrProTrpProLeuTyrGlyAsnGluGlyCysGlyTrpAlaGlyTrpLeuLeuSerPro 
Z 4 1 TACCCTTGGCCCCTCTATGGCAATGAGGGCTGCGGGTGGGCGGGATGGCTCCTGTCTCCC 
ATGGGAACCGGGGAGATACCGTTACTCCCGACGCCCACCCGCCCTACCGAGGACAGAGGG 

ArgGlySerArgProSerTrpGlyProThrAspProArgArgArgSerArgAs^euGly 
3 0 1 CGTGGCTCTCGGCCTAGCTGGGGCCCCACAGACCCCCGGCGTAGGTCGCGCAATTTGGGT 
GCACCGAGAGCCGGATCGACCCCGGGGTGTCTGGGGGCCGQ^TCCAGCGCGTTAAACCCA 

LysVallleAspThrLeuThrCysGlyPheAlaAspLeuMetGlyTyrlleProLeuVal 
3 6 1 AAGGTCATCGATACCCTTACGTGCGGCTTCGCCGACCTCATGGGGTACATACCGCTCGTC 
TTCCAGTMCTATGGGAATGCACGCCGAAGCGGCTGGAGTACCCCATGTATGGCGAGCAG 

GlyAlaProLeuGlyGlyAlaAlaArgAlaLeuAlaHisGlyValArgValLeuGluAsp 
421 GGCGCCCCTCTTGGAGGCGCTGCCAGGGCCCTGGCGCATGGCGTCCGGGTTCTGGAAGAC 
CCGCGGGGAGAACCTCCGCGACGGTCCCGGGACCGCGTACCGCAGGCCCAAGACCTTCTG 

GlyValAsnTyrAlaThrGlyAsnLeuProGlyCysSerPheSerllePheLeuLeuAla 
481 GGCGTGAACTATGCAACAGGGAACCTTCCTGGTTGCTCTTTCTCTATCTTCCTTCT 

CCGCACTTGATACGTTGTCCCTTGGAAGGACCAACGAGAAAGAGATAGAAGGAAGACCGG 
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LeulAuSerCyaLeuThrValProAlaSerAlaTyrGlnValArcAsnSerThrGlyLeu 
541 C7GCTCTCTTGCTTGACTGTGCCCGCTTCGGCCTACCAAGTGCGCAACTCCACGGGGCTT 
GACGAGAGAACGAACTGAGACGGGCGAAGCCGGATGGTTCACGCGTTGAGGTGCCCCGAA 

TyrHisValThrAsnAspCysProAsnSerSerlleValTTrGluAlaAlaAspAlalle 

601 TACCACGTCACCAATGATTGCCCTAACTCGAGTATTGTGTACGAGGC3GCCGATGCCATC 
ATGGTGCAGTGGTTACTAACGGGATTGAGCTCATAACACATGCTCCGCCGGCTACGGTAG 

LeuHisThrProGlyCysValProCysValArgGluGlyAsnAlaSerArgCysTrpVal 
661 CTGCACACTCCGGG6TGCGTCCCTTGCGTTCGTGAGG6CAACGCCTCGAGGTGTTGGGTC 
GACGTGTGAGGCCCCACGCAGGGAACGCAAGCACTCCCGTTGCGGAGCTCCACAACCCAC 

AlaiMetThrProThrValAlaThrArgAspGlyLysLeuProAlaThrGlnLeuArgArg 
721 GCGATGACCCCTACGGTGGCCACCAGGGATGGCAAACTCCCCGCGACGCAGCTTCGACGT 
CGCTACTGGGGATGCCACCGGTGGTCCCTACCGTTTGAGGGGCGCTGCGTCGAAGCTGCA 

HisIleAspLeuLeuValGlySerAlaThrLeuCysSerAlaLeuTyrValGlyAspLeu 
781 CACATCGATCTGCTTGTCGGGAGCGCCACCCTCTGTTCGGCCCTCTACGTGGGGGACCTA 
GTG T AGCTAGACGAACAGCCCTCGCGGTGGG AGACAAGCCGGGAGATGCACCCC CTGGAT 

CysGlySerValPheLeuValGlvGlnLeuPheThrPheSerProArcrArgHisTrpThr 

i 4 1 TGCGGGTCTGTCTTTGTTGTCGGCCAACTGTTCACCTTCTCTCCCAGGCGCCACTGGACG 
ACGCCCAGACAGAAAGAACAGCCGGTTGACAAGTGGAAGAGAGGGTCCGCGGTGACCTGC 

ThrGlnGlyCysAsnCysSerlleTyrProGlyHisIleThrGlyHisArgMetAlaTrp 
901 ACGCAAGGTTGCAATTGCTCTATCTATCCCGGCCATATAACGGGTCACCGCATGGCATGG 
TGCGTTCCAACGTTAACGAGATAGATAGGGCCGGTATATTGCCCAGTGGCGTACCGTACC 

AspMetMetMetAsnTrpSerProThrThrAlaLeuValMetAlaG InLeuLeuArg lie 
961 GATATGATGATGAACTGGTCCCCTACGACGGCGTTGGTAATGGCTCAGCTGCTCCGGATC 
CTATACTACTACTTGACCAGGGGATGCTGCCGCAACCATTACCGAGTCGACGAGGCCTAG 

ProGlnAlalleLeuAspMetlleAlaGlyAlaHisTrpGlyValLeuAlaGlylleAla 
1021 CCACAAGCCATCTTGGACATGATCGCTGGTGCTCACTGGGGAGTCCTGGCGGGCATAGCG 
GGTGTTCGGTAGAACCTGTACTAGCGACCACGAGTGACCCCTCAGGACCGCCCGTATCGC 

TvrPheSerMetValGlyAsnTrpAlaLysValLeuValValLeuLeuLeuPheAlaGly 
1 : 3 1 IATTTCTCCATGGTGGGGAACTGGGCGAAGGTCCTGGTAGTGCTGCTGCTATTTGCCGGC 
ATAAAGAGGTACCACCCCTTGACCCGCTTCCAGGACCATCACGACGACGATAAACGGCCG 

ValAspAlaGluThrHisValThrGlyGlySerAlaGlyHisThrValSerGlyPheVal 
1141 GTCGACGCGGAAACCCACGTCACCGGGGGAAGTGCCGGCCACACTGTGTCTGGATTTGTT 
CAGCTGCGCCTTTGGGTGC^TGIGCCCCCTTCACGGCCGGTGTGACACAGACCTAAACAA 

SerLeuLeuAlaProGlyAlaLysGltiAsnValGlnLeuIleAsnThrAsnGlySerTrp 
1201 AGCCTCCTCGCACCAGGOGCCAAGCAGAACGTCCAGCTGATCAACACCAACGGCAGTTGG 
TCGGAGGAGCGTGGTCCGCGGTTCGTCTTGCAGGTCGACTAGTTGTGGTTGCCGTCAACC 

HisI^uAsnSerThrAlaLexiAsnCysAsnAspSerLeuAsnThrGlyTrpLeuAlaGly 
1261 CACCTCAATAGCACGGCCCTGAACTGCAATGATAGCCTCAACACCGGCTGGTTGGCAGGG 
GTGGAGTTATCGTGCCGGGACTTGACGTTACTATCGGAGTTGTGGCCGACCAACCGTCCC 

LeuPheTyrHlsHisLysPheAsnSerSerGlyCysProGluArgLeuAlaSerCysArg 
1321 CTTTTCTATCACCACAAGTTCAACTCTTCAGGCTGTCCTGAGAGGCTAGCCAGCTGCCGA 
GAAAAGATAGTGGTGTTCAAGTTCAGAAGTCCGACAGGACTCTCCGATCGGTCGACGGCT 
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ProI^uThrAspPheAspGlnGlyTrpGlyProIleSerTTrAicAsnGlySerGlyPro 
^ 381 CCCCTTACCGATT TTG ACCAGGGCTGGGGCCCTATCAGTTATGCC1AACGGAAGCGGCCCC 
GGGGAATGGCTAAAACTGGTCCCGACCCCGGGATAGTCAATACGC'tTTGCCTTCGCCGGGG 

AspGlnArgProTyrCysTrpHisTyrProProLysProCysGL ,f lleValProAlaLys 
1441 GACCAGCGCCCCTACTGCTGGCACTACCCCCCAAAACCTTGCGGUATTGTGCCCGCGAAG 
CTGGTCGCGGGGATGACGACCGTGATGGGGGGTTTTGGAACGCCATAACACGGGCGCTTC 

SerValCysGlyProValTyrCysPheThrProSerProValVaiValGlyThrThrAsp 
1501 AGTGTGTGTGGTCCGGTATATTGCTTCACTCCCAGCCCCGTGGTtjGTGG GAAC GACCGAC 
TCAGACACACCAGGCCATATAACGAAGTGAGGGTCGGGGCACCA2CAC^ 

ArgSerGlyAlaProThrTyrSerTrpGlyGluAsnAapThrAspvalPheValLeuAsn 

1561 AGG TCGGGCGCGCC CACCTACAGCTGGGGTGAAAATGATACGGACGTCTTCGTCCTTAAC 
TCCAGCCCGCGCGGGTGGATGTCGACCCCACTrrTACTATGCCTGCAGAAGCAGGAATTG 

AsnThrArgProProLeuGlyAsnTrpPheGlyCysThrTrpMetAsnSerT^ 
1621 AATACCAGGCCACCGCTGGGCAATTGGTTCGGTTGTACCTGGATGAACTCAACTGGATTC 
TTATGGTCCGGTGGCGACCCGTTAACCAAGCCAACATGGACCTACTTGAGTTGACCTAAG 

ThrLysValCysGlyAlaProProCysVallleGlyGlyAlaGlyAsnAsnThrLeuHis 
.581 ACCAAAGTGTGCGGAGCGCCTCCTTGTGTCATCGGAGGGGCGGGCAACAACACCCTGCAC 
TGGTTTCACACGCCTCGCGGAGGAACACAGTAGCCTCCCCGCCCGTTGTTGTGGGACGTG 

CysProThrAspCysPheArgLysHisProAspAlaThrTyrSerArgCysGlySerGly 
1741 TGCCCCACTGATTGCTTCCGCAAGCATCCGGACGCCACATACTCTCGGTGCGGCTCCGGT 
ACGGGGTGACTAACGAAGGCGXTCGTAGGCCTGCGGTGTATGAGAGCCACGCCGAGGCCA 

ProTrpIleThrProArgCysLeuVaiAspTyrProTyrArgLeuTrpHisTyrProCys 
1801 CCCTGGATCACACCCAGGTGCCTGGTCGACTACCCGTATAGGCTTTGGCATTATCCTTGT 
GGGACCTAGTGTGGGTCCACGGACCAGCTGATGGGCATATCCGAAACCGTAATAGGAACA 

ThrlleAsnTyrThrllePheLysIleArgMetTyrValGlyGlyValGluHisArgLeu 
1361 ACCATCAACTACACCATATTTAAAATCAGGATGTACGTGGGAGGGGTCGAACACAGGCTG 
TGGTAGTTGATGTGGTATAAATTTTAGTCCTACATGCACCCTCCCCAGCTTGTGTCCGAC 

GluAlaAlaCysAsnTrpThrArgGlyGluArgCysAspLeuGluAspArgAspArgSer 
1221 GAAGCTGCCTGCAACTGGACGCGGGGCGAACGTTGCGATCTGGAAGACAGGGACAGGTCC 
CTTCGACGGACGTTGACCTGCGCCCCGCTTGCAACGCTAGACCTTCTGTCCCTGTCCAGG 

GluLeuSerProLeuI^ulAuThrThrThrGlnTrpGlnValLeuProCysSerPheThr 
1981 GAGCTCAGCCCGTTACTGCTGACCACTACACAGTGGCAGGTCCTCCCGTGTTCCTTCACA 
CTCGAGTCGGGCAATGACGACTGGTGATGTGTCACCGTCCAGGAGGGCACAAGGAAGTGT 

ThrLeuProAlaLeuSerthrGlyl^uIleHisI^uHisGlnAsnlleValAspValGln 
2041 ACCCTACCAGCCTTGTCCACCGGCCTCATCCACCTCCACCAGAACATTGTGGACGTGCAG 
TGGGATGGTCGGAACAGGTGGCCGGAGTAGGTGGAGGTGGTCTTGTAACACCTGCACGTC 

TyrLeuTyrGlyValGlySerSerlleAlaSertrpAlalleLysTrpGluTyrValVal 
2101 TACTTGTACGGGGTGGGGTCAAGCATCGCGTCCTGGGCCATTAAGTGGGAGTACGTCGTT 
ATGAACATGCCCCACCCCAGTTCGTAGCGCAGGACCCGGTAATTCACCCTCATGCAGCAA 

LeuI^uPhel^uI^uLeuAiaAspAlaArgValCysSerCysI^uTr^etMetLeuLeu 
2161 CTCCTGTTCCTTCTGCTTGCAGACGCGCGCGTCTGCTCCTGCrTGTGGATGATGCTACTC 
GAGGACAAGGAAGACGAACGTCTGCGCGCGCAGACGAGGACGAACACCTACTACGATGAG 
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1221 



2341 



2401 



2461 



2581 



IleSerGlnAlaGluMaAlal^uGluAsnlAuVallleLeuAsnAlaAlaSerLeuAla 
ATATCCCAAGCGGAGK 

rATAGGGTTCGCCTCCGCCGAAACCTCTTGGAGCATTATGAATTACGTCGTAGGGACCGG 



Giy^^Hi^lyl^uValSerPheLeuValPhePheC^sPheAlaTrpTyrLeiiLysGly 
^281 GGGACGCACGGTCTTGTATCCTTCCTCGTGTTCTTCTGCTTTGCATGGTATTTGAAGGGT 
CCCTGCGIX^CAGAACATAGGAAGGAGCACAAGAAGACGAAACGTACCATAAACTTCCG^ 

ffiBEZ— yA1 aValT 7rThr PheTyrGlyMe tTrp P roLei iT ,eu LeuLeuLeu 
TTCACCCACGGGCCTCGCCAGATGTGGAAGATGCCCTACACCGGAGAGGAGGACGAG 

AACCGCAACGGGGTCGCCCGCATGCGCGACCTGTGCCTCCACCGGCGCAGCACACCGCCA 
GTTGTTCTCGTCGGGTTGATGGCGCTGACTCTGTCACCAT^ 

CAACAAGAGCAGCCCAACTACCGCG ACTGAGACAGTGG TATAATGTTCGCGATATAGTCG 

TrpCys^TrpT^l^uGlnTyrPheLeuThrArqValGluAlaGlnLeuHisValTrD 
-GGTGCTTGTGGTGGCTTCAGTATTTTCTGACCAGAGTGGAAGCGCAACTGCACG 
ACCACGAACACCACCGAAGTCATAAAAGACTGGTCTCACCTTCGCGTTGACGTGCACACC 

^CCCCCCT^GTCCGAGO*^ 

TAAGGGGGGGAGTTGCAGGCTCCCCCCGCGCTGCGGCAGTAGAATGAGTACACACGACAT 

2641 CACCCGACTCTGGTATTTGACATCACCAAATTGCTGCTGGCCGTCTTCGGACCCCTTTC 
GTGGGCTGAGACCATAAACTGTAGTGGTTTAACGACGACCGGCAGAAGCCTGGGGAAACC 

Ilel^uGlnAlaSerl^uI^uLysValProTyrPheValArgValGlnGlyr^uI^u^ 
2/01 ATTCTTCAAGCCAGTTTGCTTAAAGTACCCTACTTTGTGCGCGTCCAAGGCCTTCTCCGG 
TAAGAAGTTCGGTCAAACGAATTTCATGGGATGAAACACGCGCAGGTTCCGGAAGAGGCC 

. - ^^^^^i^LysMetlleGlyGlyHisTyrValGlnMetValllelleLys 
-■ Gi F I^^n^5S GGAAGATG A TCGGAGGCC^TTACGTGa\AATGGTCATCATTAAG 
AAGACGCGCAATCGCGCCTTCTACTAGCCTCCGGTAATGCACGTTTACCAGTAGTAATTC 

~euGlyMa^T^ 

2821 TTAGGGGCGCTTACTGGCACCTATGTTTATAACCATCTCACTCCTCTTCGGGACTGGGCG 
AATCCCCGCGAATGACCGTGGATACAAATATTGGTAGAGTGAGGAGAAGCCCTGACCCGC 

HlsAsnGlyLeuArgAspLeuAlaValAlaValGluProValValPheSerGlnMetGlu 
2881 CACAACGGCTTGCGAGATCTGGCCGTGGCTGTAGAGCCAGTCGTCTTCTCCCAAATGGAG 
GTGTTGCCGAACGCTCTAGACCGGCACCGACATCTCGGTCAGCAGAAGAGGGTTTACCTC 

i I5Sty2i£if ^^^iy^^PThrAlaAlaCysGlyAspIlelleAsnGlyLeu 
2941 ACCAAGCTCATCACGTGGGGGGCAGATACCGCCGCGTGCGGTGACATCATCAACGGCTTG 
TGGTTCGAGTAGTGCACCCCCCGTCTATGGCGGCGCACGCCACTGTAGTAGTTGCCGAAC 

3001 CCTGTTTCCGCCCGCAGGGGCCGGGAGATACTGCTCGGGCCAGCCGATGGAATGGTCTCC 
GGACAAAGGCGGGCGTCCCCGGCCCTCTATGACGAGCCCGGTCGGCTACCTTACCAGAGG 



FIGURE 12-4 



52 



EP 0 419 182 B1 



LysGlyTrpArgLeuLeuAlaProIleThrAlaTyrAlaGlnGLaThrArgGlyLeuLeu 
1061 AAGGGGTGGAGGTTGCTGGCGCCCATCACGGCGTACGCCCAGCAGACAAGGGGCGCCCTA 
TTCCCCACCTCCAACGACCGCGGGTAGTGCCGCATGCGGGTCGT CTGTTCCCCGGAGGAT 

GlyCysIlelleThrSerLeuThrGlyArgAspLysAsnGlnValGluGlyGluValGin 
3121 GGGTGCATAATCACCAGCCTAACTGGCCGGGACAAAAACCAAGTGGAGGGTGAGGTCCAG 
CCCACGTATTAGTGGTCGGATTGACCGGCCCTGTTTTTGGTTCACC'TCCCACTCCAGGTC 

IleValSerThrAlaAlaGlnThrPheLeuAlaThrCyslleAsnGlyValCysTrpThr 

- 18 1 ATTGTGTCAACXGCTGCCCAAACCTTCCTGGCAACGTGCATCAATGGGGTGTGCTGGACT 
TAACACAGTTGACGACGGGTTTGGAAGGACCGTTGCACGTAGTTACCCCACACGACCTGA 

valTyrHisGlyAlaGlyThrArgThrlleAlaSerProLysGlyProVallleGlnMet 
3 24 1 GTCTACCACGGGGCCGGAACGAGGACCATCGCGTCACCCAAGGGTCCTGTCATCCAGATG 
CAGATGGTGCCCCGGCCTTGCTCCTGGTAGCGCAGTGGGTTCCC^GACAGTAGGTCTAC 

TyrThrAanValAspGlnAspLeuValGlyTrpProAlaProGlnGlySerArg 
3301 TATACCAATGTAGACCAAGACCTTGTGGGCTGGCCCGCTCCGCAAGGTAGCCGCTCATTG 
ATATGGTTACATCTGGTTCTGGAACACCCGACCGGGCGAGGCGTTCCATCGGCGAGTAAC 

ThrProCysThrCysGlySerSerAspLeuTyrLeuValThrArgKisAlaAspVallle 
361 ACACCCTGCACTTGCGGCTCCTCGGACCTTTACCTGGTCACGAGGCACGCCGATGTCAXT 
TGTGGGACGTGAACGCCGAGGAGCCTGGAAATGGACCAGTGCTCCGTGCGGCTACAGTAA 

ProValArgArgArgGlyAspSerArgGlySerLeuLeuSerProArgProIleSerTyr 
3421 CCCGTGCGCCGGCGGGGTG ATAGCAGGGGCAGCCTGCTGTCGCCCCGGCCCATTTCCT AC 
GGGCACGCGGCCGCCCCACTATCGTCCCCGTCGGACGACAGCGGGGCCGGGTAAAGGATG 

LeuLysGlySerSerGlyGlyProLeuLeuCysProAlaGlyHisAlaValGlyllePhe 
3481 TTGAAAGGCTCCTCGGGGGGTCCGCTGTTGTGCCCCGCGGGGCACGCCGTGGGCATATTT 
AACTTTCCGAGGAGCCCCCCAGGCGACAACACGGGGCGCCCCGTGCGGCACCCGTATAAA 

ArgAlaAlaValCysThrArgGlyValAlaLysAlaValAapPhelleProValGluAsn 
3541 AGGGCCGCGGTGTGCACCCGTGGAGTGGCTAAGGCGGTGGACTTTATCCCTGTGGAGAAC 
TCCCGGCGCCACACGTGGGCACCTCACCGATTCCGCCACCTGAAATAGGGACACCTCTTG 

LeuGluThrThrMetArgSerProValPheThrAspAsnSer Ser P roProValValPro 
3601 CTAGAG ACAACCATGAGGTCCCCGGTGTTCACGGATAACTCCTCTCCACCAGTAGTGCCC 
GATCTCTGTTGGTACTCCAGGGGCCACAAGTGCCTATTGAGGAGAGGTGGTCATCACGGG 

GlnSerPheGlnValMaHisLeuHisAlaProThrGlySerGlyLysSerThrLysVal 
j 6 6 1 CAGAGCTTCCAGGTGGCTCACCTCCATGCTCCCACAGGCAGCGGCAAAAGCACCAAGGTC 
GTCTCGAAGGTCCACCGAGTGGAGGTACGAGGGTGTCCGTCGCCGTTTTCGTGGTTCCAG 

- ProAlaAlaTyrAlaAlaGlnGlyTyrLysValLeuValLeuAsnP roSerValAlaAla 
3721 CCGGCTGCATATGCAGCTCAGGGCTATAAGGTGCTAGTACTCAACCCCTCTGTTGCTGCA 
GGCCGACGTATACGTCGAGTCCCGATATTCCACGATCATGAGTTGGGGAGACAACGACGT 

ThrLeuGlyPheGlyAlaTyrMetSerLys AlaHisGly I leAspP roAsn IleArgThr 
3781 AQVCTGGGCTTTGGTGCTTACATGTCCAAGGCTGVTGGGATCGATCCTAACATCAGGAC 
TGTGACCCGAAACCACGAATGTACAGGTTCCGAGTACCCTAGCTAGGATTGTAGTCCTGG 

GlyValArgThrlleThrThrGlySerProIle 
3841 GGGGTGAGAACAATTACCACTGGCAGCCCCATCACGTACTCCACCTACGGCAAGTTCCTT 

CCCCACTCTTGTTAATGGTGACCGTCGGGGTAGTGCATGAGGTGGATGCCGTTCAAGGAA 
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CGGCTGCCGCCCACGAGCCCCCCGCGAATACTGTATTATTAAACACTGCTCACGGTGAGG 

, QC . ^^^a^u^Ile^uGlylleGlyThrValLeuAapGlnAlaGluThrAlaGly 
j961 ^^^i^TCaTCTTGG<K»TCGGCACTGTCCTTGACCAM 

rGCCTACGGTGTAGGTAGAACCCGTAGCCGTGACAGGAACTGGTTCGTCTCTGACGCCCC 

,021 ^^S^S?^SSSSgSS5SS?SS^ 

CGCTCTGACCAACACGAGCGGTGGCGGTGGGGAGGCCCGAGGCAGTGACACGGGGTAGGG 

TTGTAGCTCCTCCAAOIAGAC^GGIGGTGGCCTCTCTAGGGAAAAATGCCGTTCCGATM 

4141 CCCCTQMAGTAATCAAG<K3G6GGA<MCATCTCATCTTCTGTCATTCAAAGAA(U^TGC 
GGGGAGCTTCATTAGTTCCCCCCCTCTGTAGAGTAGAAGACAGTAAGTTTCTTCTTCACG 

As^luI^uAlaAlaLysI^uVaiAlal^uGlylleAarjaaValAlaTvrtVrAroGlv 
V.TGCTTGAGCGGCGTTTOGACCAGCGTAACCCGTAGTTACGGCACCGGATGATGGCGCCA 

4261 s™i~£^^ 

GAACTGCACAGGCAGTAGGGCTGGTCGCCGCTACAACAGCAGCACCGTTGGCTACGGGAG 

^J^iVl^^lyAspPheAspSexVallleAspCysAsnThrCysValThrGln 
4321 *^£^ATACCGOX»CTTCGACT^ 

TACTGGCCGATATGGCCGCTGAAGCTGAGCCACTATCTGACGTTATGCACACAGTGGGTC 

4381 ™TS~?T?^S7F^ CCT ^ CT ^ C ^ 

TGTCAGCTAAAGTCGGAACTGGGATGGAAGTGGTAACTCTGTTAGTGCGAGGGGGTCCTA 

4441 ^^^^^sssEssgffliassaasaBa 

v-GaCAGAGGGCGTGAGTTGCAGCCCCGTCCTGACCGTCCCCCTTCGGTCCGTAGATGTCT 

4501 rFFST^^ c ^ G ^ CGCCC ^ a ^ 

AAACACCGTGGCCCCCTCGCGGGGAGGCCGTACAAGCTGAGCMX^ 
ATACTGCGTCCGACACGAACCATACTCGAGTGCGGGCGGCTCTGATGTCAATCCGATGCT 

4621 

ValPheThrGlyLeuThrHisIleAspAlaHisPheLeuSerGlnThrLysGlnSerGly 
4681 GTCTTTACAGGCCTCACTCATATAGATGCCCACTTTCTATCCCAGACAAAGCAGAGTGGG 
CAGAAATGTCCGGAGTGAGTATATCTACGGGTGAAAGATAGGGTCTGTTTCGTCTCACCC 
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GluAsnLeuProTyrl^uVaLAlaTyrG 
4741 GAGAACCTTCCTTACCTGGTAGCGTACCAAGCCACCGTGTGCGCT^.GGGCTCAAGCCCCT 

CTCTTGGAAGGAATGGACCATCGCATGGTTCGGTGGCACACGC SATCCCGAGTTCGGGGA 

ProProSertrpAspGlnMetTrpLysC7sI^uIleArgI^uLysi , roThrLeuHisGly 
4801 CCCCCATCGTGGGACCAGATGTGGAAGTGTTTGATTCGCCTCAAGCCCACCCTCCATGGG 
GGGGGTAGCACCCTGGTCTACACCTTCACAAACTAAGCGGAGTTCGGGTGGGAGGTACCC 

ProThrProI^uI^uTyrArgLeuGlyAlaValGlnAsnGluIleThrLeuThrHisPro 
4861 CCAACACCCCTGCTATACAGACTGGGCGCTGTTCAGAAXGAAATG^CCCTGACGCACCCA 
GGTTGTGGGGACGATATGTCTGACCCGCGACAAGTCTTACTTO 

ValThrLysTyrileMetThrCysMetSerAla^ 

4921 GTCACCAAATACATCATGACATGCATGTCGGCCGACCTGGAGGTCGTCACGAGCACCTGG 
CAGTGGTTTATGTAGTACTGTACGTACAGC^ 

ValLeuValGlyGlyVall^uAlaJUal^uAlaM^ 
4981 GTGCTCGTTGGCGGCGTCCTGGCTGCTTTGGCCGCGTATTGCCTGTCAACAGGCTGCGTG 
CACGAGCAACCGCCGCAGGACCGACGAAACCGGCGCATAACGGACAGTTGTCCGACGCAC 

VallleValGlyArgValValLeuSerGlyLysProAlallelleProAspArgGluVal 
5041 GTCATAGTGGGCAGGGTCGTCTTGTCCGGGAAGCCGGCAATCATACCTGACAGGGAAGTC 
CAGTAICACCCGTCCCAGC^GAACAGGCCCTTCGGCCGTTAGTATGGACTGTCCCTTCAG 

LeuTyrArgGluPheAspGluMetGluGluCysSerGlnHisLeiiProTyrlleGluGIn 
5101 CTCTACCGAGAGTTCG ATG AGATGGAAGAGTGCTCTCAGCACTTACCG TACATCG AGCAA 
GAGATGGCTCTCAAGCTACTCTACCTTCTCACGAGAGTCGTGAATGGCATGTAGCTCGTT 

GlyMet^etl^uAlaGluGlnPheLysGlnLysAlaLeuGlyLeuLeuGlnThrAlaSer 
5161 GGGATGATGCTCGCCGAGCAGTTCAAGCAGAAGGCCCTCGGCCTCCTCCAGACCGCGTCC 
CCCTACTACGAGCGGCTCGTCAAGTTCGTCTTCCGGGAGCCGGAGGAOGTCTGGCGCAGG 

ArgGlnAlaGluVallleAlaProAlaValGlnThrAsnTrpGlnLysLeuGluThrPhe 
5221 CGTCAGGCAGAGGTTATCGCCCCTGCTGTCCAGACCAACTGGCAAAAACTCGAGACCTTC 
GCAGTCCGTCTCCAATAGCGGGGACGACAGGTCTGGTTGACCGTTTTTGAGCTCTGGAAG 

TrpAlaLysHisMetTrpAsnPhelleSerGlylleGlnTyrLeuAlaGlyLeuSerThr 
: Z 8 1 rGGGCGAAGCATATGTGGAACTTCATCAGTGGGATACAATACTTGGCGGGCTTGTCAACG 
ACCCGCTTCGTATACACCTTGAAGTAGTCACCCTATGTTATGAACCGCCCGAACAGTTGC 

LeuProGlyAsnProAlalleAlaSerLeuMetAlaPheThrAlaAlaValThrSerPro 
5341 CTGCCTGGTAACCCCGCCATTGCTTCATTGATGGCTTTTACAGCTGCTGTCACCAGCCCA 
GACGGACCATTGGGGCGGTAACGAAGTAACXACCGAAAATGTCGACGACAGTGGTCGGGT 

LeuThrThrSerGlnThrLeuLeuPheAanlleLeuGlyGlyTrpValMaAlaG 
5401 CTAACCACTAGCCAAACCCTCCTCTTCAACATATTGGGGGGGTGGGTGGCTGCCCAGCTC 
GATTGGTGATCGGTTTGGGAGGAGAAGTTGTATAACCCCCCCACCCACCGACGGGTCGAG 

AlaAlaProGlyAlaAlaThrAlaPheValGlyAlaGlyLeuAlaGlyAlaAlalleGly 
5461 GCCGCCCCCGGTGCCGCTACTGCCTTTGTGGGCGCTGGCTTAGCTGGCGCCGCCATCGGC 
CGGCGGGGGCCACGGCGATGACGGAAACACCCGCGACCGAATCGACCGCGGCGGTAGCCG 

SerValGlyLeuGlyLysVall^uIleAspIlel^iiAlaGlyTyrGlyAlaGlyValAla 
5521 AGTGTTGGACTGGGGAAGGTCCTCATAGACATCCTTGCAGGGTATGGCGCGGGCGTGGCG 
TCACAACCTGACCC CTTCCAGGAGTATCTGTAGGAACG TCCCATACCGCGCCCGCACCGC 
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5821 n^^^^^^S 

5541 ^^^^sssssssasss^sss^ssst 

ACCCTGACCTATACGCTCCACAACTCGCTGAAATTCT6GACCGATTTTCGATTCGAGTAC 

8001 ^^S^gsssssssssssssssssssw 

GGTGTCGACGGACCCTAGGGGAAACACAGGAOGGTCGCGCCCATATTCCCCCAGACCGCT 
CACCTGCCGTAGTACGTGTGAGCGACGGTGACACCTOTACTCTAGTGACCTGTACAGTTT 
TTGCCCTGCTACTCCTAGCAGCCAGGATCCTGGACGTCCTTGTACACCTCACCCTGGAAG 
GGGTAATTACGGATGTGGTCCCCGGGGACATGGGGGGAAGGACGCGGCTTGATGTGC^ 

^ G ^?5£££^ TCT ^^ 

CGCGATACCTCCCACAGACGTCTCCTTATACACCTCTATTCCGTCCACCCCCTGAAGGTG 

6301 TACGTGACGGGTATGACTACTGACAATCTCAAATGCCCGTGCCAGGTCCCATCGCCCGAA 
ATGCACTGCCCATACTGATGACTGTTAGAGTTTAOGGGCACGGTCCAGGGTAGCGGGCTT 

6361 TTTTTCACAGAATTGGACGGGGTGCGCCTACATAGGTTTGCGCCCCCCTGCAAGCCCTTC 
AAAAAGTGTCTTAACCTGCCCCACGCGGATGTATCCAAACGCGGGGGGACGTTCGGGAAC 
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6421 3SSSBffiKK833K8^^ 

- . « , SJ^5 GluProGluProA8 P ValAlaVa l^euThrSetttetLeuThrAsoPrQSerHia 
GGAACGCTCGG<Xa!T<^CTG<»CC(WCACi^T«y^ 

6601 SSSSSSSSSS^^ 

AGGAGCCGATCGGTCGATAGGCGAKTAGAGAGTTCCGTTGAAOT 

6661 





6901 

TGCCCGCTGTTATGCTGTTGTAGGA6ACTCGGGCGGGGAAGACCGACGGGGGGGCTGATC 

7141 S^^^SP^^ssg^^^s 

CTGCGACTCAGGATAAGGAGGTACGGGGGGGACCTCCCCCTCGGACCCCTAGGCCTAGAA 
TCGCTGCCCAGTACCAGTTGCCAGTCATCACTCCGGTTGC^ 
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SezMetSerTTrSerTrpThrGlyAlaLeuValThrProCysAlaAlaGliiGluGlnLys 
7261 TCAATGTCTTACTCTTGGACAGGCGCACTC^ 

AGTTACAGAATGAGAACCTGTCCGCGTGAGCAGTGGGGCACGCGGCGCC^ 

LeuProIleAsnAlaLeuSerAsnSerLeuLeuArgHisHisAsnLeuValTTrSerThr 
"321 CTGCCCATCAATGCACTAAGCAACTCGTTGCTACGTCACCACAATTTGGTGTATTCCACC 
GACGGGTAGTTACGTGATTCGTTGAGCAACGATGCte 

ThrSerAr^SerAlaCysGlnArgGlJiLysLysValThrPheAspArgLeuGlnValXeu 
7381 ACCTCACGCAGTGCTTGCCAAAGGCAGAAGAAAGTCACATTTGA 
TGGAGTGCGTCACGAACGGTTTCCGTCTTCTTTCAGTC 

AspSerHisTyrGlnAspValLeuLysGluValLysAlaAlaAlaSerLysValLysAla 
7441 GACAGCCATTACCAGGACGTACTCAAGGAGGTTAAAGCAGCGGCGTCAAAAGTC 
CTGTCGGTAATGGTCCTGCATGAGTTCCTCCAATTTCGTCGCCGCAGTTTTCAC 

AsoLeuLeuSerValGluGluAlaCysSerLeuThrProProKisSefAlaLysSerLys 
7501 AACTTGCTATCCGTAGAGGAAGCTTGCAGCCTGACGCCCCCAGVCTCAGCCAAATCCAAG 
TTGAACGATAGGGATCTCCTTCGAACGTCGGACTGCGGGGGTC 

PheGlyTyrGlyAlaLysAspValArgCysHisAlaArgLyaAlaValThrHisIleAsn 
* 5 6 1 TTTGGTTATGGGGCAAAAGACGTCCGXTGCGATGCCAGAAAGGCCGTAACCCACATCAAC 
AAACCAATACCCCGTTTTCTGCAGGCAACGGTACGGTCTTTCCGGCATTGGGTGTAGTTG 

SerValTrpLysAspLeul^uGluAspAsnValThrProIleAspThrThrlleMetAla 
"621 TCCGTGTGGAAAGACCTTCTGGAAGACAATGTAACACCAATAGACACTACCATCATGGCT 
AGGCACACCTTTCTGGAAGACCTTCTGTTACATTGTGGTTATCTGTGATGGTAGTACCGA 

LyaAaaGluValPheCysValGlnProGluLysGlyGlyArgLysProAlaArgLemie 
7681 AAGAACGAGGTTTTCTGCGTTCAGCCTGAGAAGGGGGGTCGTAAGCCAGCTCGTCTCATC 
TTCTTGCTCCAAAAGACGCAAGTCGGACTCTTCCC^ 

ValPheProAspLeuGlyValArgValCysGltiLysMetAlaLeuTyrAspValValThr 

7741 GTGTTCCCCGATCTGGGCGTGCGCGTGTGCGAAAAGATGGCTTTGTACGACGTGGTTACA 
CACAAGGGGCTAGACCCGCACGCGCACACGCTTTTCTACCGAAACATGCTGCACCAATGT 

LysI^uProLeuAlaVaJJletGlySerSerTyrGlyPheGlnTyrSerProGlyGlaArg 
"301 AAGCTCCCCTTGGCCGTGATGGGAAGCTCCTACGGATTCCAATACTCACCAGGACAGCGG 
TTCGAGGGGAACCGGCACTACCCTTCGAGGATGCCTAAGGTTATGAGTGGTCCTGTCGCC 

ValGluPheLeuValGlnAlaTrpLyaSerLysLysThrPrcMetGlyPheSerTyrAsp 
7861 GTTGAATTCCTCGTGCAAGCGTGGAAGTCCAAGAAAACCCCAATGGGGTTCTCGTATGAT 
CAACTTAAGGAGCACGTTCGCACCTTCAGGTTCTTTTGGGGTTACCCCAAGAGCATACTA 

ThrArgCyaPheAspSerThrValThrGluSerAspIleArgThrGluGluAlalleTyr 
7921 ACCCGCTGCTTTGACTCCACAGTCACTGAGAGCGACATCCGTACGGAGGAGGCAATCTAC 
TGGGCGACGAAACTGAGGTGTCAGTGACTCTCGCTGTAGGCATGCCTCCTCGSXTAGATG 

GlnCysCyaAspLeuAspProGlnAlaArgValAlalleLysSerLeuThrGluArgLeu 
7981 CAATGTTGTGACCTCGACCCCCAAGCCCGCGTGGCCATCAAGTCCCTCACCGAGAGGCTT 
GTTACAACACTGGAGCTGGGGGTTCGGGCGGACCGGTAGTTCAGGGAGTGGCTCTCCGAA 

TyrValGlyGlyProLeuThrAsnSerArgGlyGluAsnCysGlyTyrArgArgCysArg 
8041 TATGTTGGGGGCCCTCTTACCAATTCAAGGGGGGAGAACTGCGGCTATOGCAGGTGCCGC 
ATACAACCCCCGGGAGAATGGTTAAGTTCCCCCCTCTTGACGCCGATAGCGTCCACGGCG 
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AlaSerGlyVall^uThrThrSerCysGlyAsnThrLeuThxCysTyrlleliysAiaArg 
3 101 GCGAGCGGCGTACTGACAACTAGCTGTGGTAAUCCCTGVCTTGCTACATCAAGGCCC^ 
CGCTCGCCGCATGACTGTTGATCGACACCATTGTGGGAGTGAACGATGTAGTTCCGGGCC 

AlaAlaCysArgAlaAlaGlyLeuGlnAspCysThrMetLeuValCysGlyAspAspLeu 

3161 GCAGCCTGTCGAGCCGCAGGGCTCCAGGACTGCACCATGCTCGTGTGTGGCGACGACTTA 
CGTCGGAGAGCTCGGCGTCCCGAGGTCCTGAGjTGGTACGAGCACACACCGCTGCTGAAT 

ValVallleCysGluSerAiaGlyValGlnGluAspAlaAlaSerLeuArgAlaPheThr 
3221 GTCGTTATCTGTGAAAGCGCGGGGGTCCAGGAGGACGCGGCGAGCCTGAGAGCCTTCACG 
CAGCAATAGACACTTTCGCGCCCCCAGGTCCTCCTGCGCCGCTCGGACTCTCGGAAGTGC 

GluAiaMetThrArgTyrSerAlaProProGlyAspProProGlnProGluTyrAspLeu 
3281 GAGGCTATGACCAGGTACTCCGCCCCCCCTGGGGACCCCCCACAACCAGAATACGACTTG 
CTCCGATACTGGTCCATGAGGCGGGGGGGACCCCTGGGGGGTGTTGGTCTTATGCTGAAC 

GluLeuIleThrSerCysSerSerAsnValSerValAlaHisAspGlyAlaGIyLysArg 
8341 GAGCTC ATAACATCATGCTCCTCCAACGTGTCAGTCGCCCACG ACGGCGCTGGAAAG AGG 
CTCGAGTATTGTAGTACGAGGAGGTTGCACAGTCAGCGGGTGCTGCCGCGACCTTTCTCC 

ValTyrTyrl^uThrArgAspProThrThrProLeuAlaArgAlaAlaTrpGluThrAla 

5401 GTCTACTACCTCACCCGTGACCCTACAACCCCCCTCGCGAGAGCTGQSTGGGAGACAGCA 
CAGATGATCGAGTGGGCACTGGGATGTTGGGGGGAGCGCTCTCGACGCACCCTCTGTCGT 

ArgHisThrProValAsnSerTrpLeuGlyAsnllelleMetPheAlaProThrLeuTrp 
8461 AGACACACTCCAGTCAATTCCTGGCTAGGCAACATAATCATGTTTGCCCCCACACTGTGG 
TCTGTGTGAGGTCAGTTAAGGACCGATCCGTTGTATTAGTACAAACGGGGGTGTGACACC 

AlaArgMetlleLeuMetThrHisPhePheSerValLeuIleAlaArgAspGlnLeuGlu 
3521 GCGAGGATGATACTGATGACCCATTTCTTTAGCGTCCTTATAGCCAGGGACCAGCTTGAA 
CGCTCCTACTATGACTACTGGGTAAAGAAATCGCAGGAATATCGGTCCCTGGTCGAACTT 

GlnAlaLeuAspCysGluIleTyrGlyAlaCysTyrSerlleGluProLeuAspLeuPro 
3581 CAGGCC CTCGATTGCGAGATCTACGGGGCCTGCTACTCCATAGAACCACTTGATCTACCT 
GTCCGGGAGCTAACGCTCTAGATGCCCCGGACGATGAGGTATCTTGGTGAACTAGATGGA 

ProIlelleGlnArgLeuHlsGlyLeuSerAlaPheSerLeuHisSerTyrSerProGlv 
3 641 CCAATCATTCAAAGACTCCATGGCCTCAGCGCATTTTCACTCCACAGTTACTCTCCAGGT 
GGTTAGTAAGTTTCTGAGGTACCGGAGTCGCGTAAAAGTGAGGTGTCAATGAGAGGTCCA 

GluIleAsnArgValAlaAlaCysLeiiArgLysLeuGlyValProProLeiiArgAlaTrp 
8701 GAAATTAATAGGGTXjGCCGCATGCCTCAGAAAACTTGGGGTACCGCCCTTGCGAGCTTGG 
CTTTAATTATCCG^CCGGCGTACGGAGTCTTTTGAACCCCATGGCGGGAACGCTCGAACC 

ArgHisArgAlaArgSerValArgAlaArgLeuLeuAlaArgGlyGlyArgAlaAlalle 
3761 AGACACCGGGCCCGGAGCGTCCGCGCTAGGCTTCTGGCCAGAGGAGGCAGGGCTGCCATA 
TCTGTGGCCCGGGCCTCGCAGGCGCGATCCGAAGACCGGTCTCCTCCGTCCCGACGGTAT 

CysGlyLysTyrLeuPheAsnTrpAlaValArgThrLysLeuLys 
8821 TGTGGCAAGTACCTCTTCAACTGGGCAGTAAGAACAAAGCTCAAAC 
ACACCGTTCATGGAGAAGTTGACCCGTCATTCTTGTTTCGAGTTTG 
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1-1319 



primer J159S 
:i ACTSCeerc AACTRT A ATK a 
PT G 

SarLeuLysTIirGl>TneLeuAlaAlaLcui , heTrrThrai8Ly8Ph0AsnAiAS«ri3i 
CTCCCrCAAAACTCGCTTTCTrCCCGCCCTGTTCTAC AC AC AC*\AGT7CAACGC GTC C 0 G 30 
TAG C C C GCT GAG t tcac t t a 

Aan Trp Cly His, Ser 

printer 166A for il-1216 



yCyfiProGluArgMetAlaSerCysArffSerileASPLysPheAspaiaGlyTrpGlyPr 

:l n; atgcccggagcocatogccagctgtcgctccattgacaacttcgaccagggatggggtcc 120 

C T T A GC A C AC C AC G T T C C 

Leu ProLau ThrAcp 



cIleTHrTyrAlaGinPrOAspAsnSerAspGlxiArffProryrCyeTrpHiisTyrAlaPr 
Jl 121 C ATCACCTATGCTC AAC CTG AC AACTCCQACCACAGGCC CT ATTGC7GGC ACTACGC AC C : 80 
?7 T GT CA CGGAAG GG C * CCCC : C 

Ser. AanSly SerCly Pra ?r? 



oAryClnCTsGlylleValFroAlaScrGlnValCycGlyProVslTyrCysPhcThrPr 
J i 181 7CGACAGTGTGGTATCGTACC CGCGTCGCAGG7GTGC GGTCC AGTOTATTGCTTC\CCC C 240 
PT AAA CT C 7 0 AA AGT T G X 7 

t-yafira LvgSer 



o3erProValValVaiClyThrThrABpArtPheClyAlaProTl\rT7rAsn7rpGlyAs 
241 AAGCCCTGTTGTACTGGGGACGACCCATCOn.CGGCaCCCCTACGTATAACTCCGCGCA 300 
C C G 3 A CAGCC GCCCG 7 

Ser 01 



pAfinGluThrAapValL5.;LeuLeuA3iiA3nThrArjppoPro«lsGiyA6iiTroPheC: 
J 1 :i01 CAATCAGACCGACC7SCTGCTCCTAAACAACACGCGGCCCCCGCACGGCAACTGG7TCGG 360 
?T A 7 CT CG 7 7 CA A. 70 7 

u As? PheVal Leu 



yCyaThr 
Jl 361 CTGTACA 307 
?T T CTGGATGAACTCAACTr«RA7T 

primer 199A 



Nucleotide Match: 259/367 (70. 5% J 

Amino Acid Maccn (atrtneent): 93/122 (75.2%) 

(ralaxad): 111/122 (91. OX) 

?rororTp e KCV (PT> sequences different from Japanese KCV (Ji; are shown. 
Stloxod aamo acid oatca: Gl*«Ala»Pro»3er»Thr ♦ Aap-Glu. Aan«GU. 
Arq *Bf-Lys=Kl5. Leylie-V&l-Met. Phe«Trp»Tyr. 
Under Una. different amino acid in relaxed matching 
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FIGURE 14-1 

core to NS1 vs. HCv^-1 



ProLeuVal 
TCCGCTCGTC 
A 



GlyAlaProlAuGlyGlyAlaAlaArgAlal^uAlaHisGlyValArgValLeuGluAsp 
1 GGCGCCCCCTTAGGGGGCGCTGCCAGGGCCCTGGCACATGGTGTCCGGGTTCTGGA^ 



M TC-T — A 



GlyvalAsnTyrMaThxGlyAsnl^iiProGly^ 
1 GGCGTCAACTATGCAACAGGGAATTTGCCCGGTT G CXCTTTCTCTATCTTCCTCTTGGCT 



CC-T — T TO 



LeuI^uSerCy aLeu ThrlleProAlaSerAlaTyrGluValArgAanValSerGlylle 
13 1 CTGCTGTCCTGTTTGACCATCCCAGCTTCCGCTTATGAM 

C — T — C TG-G — C G — C — CC TCCA-G OT 

Val Gin SerThr Leu 

rTTHisValThrAsnAspCysSerAsnSerSerlleValTyrGluAlaAlaAsDVallle 
- '7 1 TACCATGTCACAAACGACTGCTCCAACTCAAGCATTGTGTATGAGGCGGCGGACGTGATC 



?ro Ala 

HetHisAiaProGlyCysValProCysValArgGluAsnAsnSerSerArgCysTrpVal 
- ^ 1 ATGCATGCCCCCGGGTGCGTGCCCTGCGTTCGGGAGAACAATTCCTCCCGTTGCTGGGTA 
C CA-T G C— T T GG CG P- A 

Leu rhr Gly. Ala 

, , , ^a^ThrProThrLeuAlaAlaArgAsnAlaSerValProThrThrTte 
311 GCGCTCACTCCCACGCTCGOSGCCAGGAATGCCAGCGTCCCCACTACGACATTAGIAC^ 
A-G — C — T G-G— CA — G G — AAC G-G— CAGC-T T 

Met val Thr AspGiyLysLeu Ala Gin 

HlsValAspLeuLeuValGlyThrAlaAlaPheCysSerAlaMetTyrValGlvAspLeu 
; " I CACGTCGACTTGCTCGTTGGGACGGCTGCTTTCTGCTCCGCTATGTACGTGGGGGATCTC 

A TC T— C GC—CA-CC T~G~ CC-C Z — A 

Ser ThxLeu Leu 

CysGlySerVaiPheLeuIleSerGlnl^uPheThrPheSerProArgArgHlsGluThr 
431 TGCGGATCTGTTTTCCTCATCTCCCAGCTGTTCACCTTCTCGCCTCGCCGGCATGAGACA 
G C — T — TG—GG A T — CA-G — C — CTG G 

ValGly Trp 

valGlnAspCysAsnCysSerlleTyrProGlyHisValSerGlyHisArgMetAlaTrp 
491 GXACAGGACTGCAACTGCTCAATCTATCCCGGCCACGTATCAGGCCATCGCATGGCTTK 
ACG— A-GT T T TA — A-G — T — C A 

Thr Gly UeThr 

AspMetMetMetAsnTrpSerProThrAlaAlaLeuValVal^ 
551 GAIATCATGATGAACTGGTCGCCCACGGCAGCCTTAGTGGTGTCGCAGTTACTCCGGATC 

C — T A-G— G— G — AA— G-T—C-G 

Thr MetAla 

ProGlnMaValMet^pMetValAlaGlyAlaHisTrpGlyValLeuAlaGlyLeuAla 
6 11 CCACAAGCTCTCATGGACATGGTGGCGGGGGCCCACTGGGGAGTCCTAGCG 



■CA— T A-C — T — T — T G A-A— G 

IleLeu He lie 

TyrTyrSerMetValGlyAflnTrpMaLysValLeuIleVa^ 
671 TACTArrCCATGGTGGGGAACTGGGCTAAGGTTTTGATTGTGATGCTAC^^ 
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— T-TC G CC — G-A C G — A 

Phe V.ii Leu 

VaiAspGlyHisThrArgValThrGlyGlyValGlnGLyHisValThrSerThrLeuThr 
3 1 G TTGACGGGCATACCCGCGTGACGGGGGGGGTGCAAGGCCACGTCACCTCTACACTCACG 

— C — — C-G-A A C — C AAGTGCC ACTGTG GGAT-TGTT 

AlaGlu His SerAia ThrVal GlyPheVal 

SerLeuPhcArgProGlyAlaSetGlnLysIleGlraeuValAanThrAsnGlySerTrp 
91 TCCCTCTTTAGACCTGGGGCGTCCCAGAAAATTCAGCTTGTAAACACCAATGGCAGTTGG 

AG C-CGC A — C — CAAGC CG-C GA-C C 

LeuAla Lys AsnVal lie 

HisIleAsnArgThrAlaLeuAsnCysAsxiAspSerLeuGlQThrGlyPheLeuAlaAla 
851 CATATCAACAGGACTGCCCTGAACTGCAATGACTCCCTCCAAACTGGGTTCCTTGCCGCG 

— CC T — C — G TAG A-C— C — C-GGT-G — A-G- 

Ser Asn Trp Gly 



911 CTGTTCTACACACACAAGTTCAACGCGTCCGGATGCCCGGAGCGCATGGCCAGCTGTCGC 

— T TCACC T-T—A— C— T — T A-GC-A C— A 

His Ser Leu 




C — C— AC-G-T — T C C — T GT CAACGG AAGCGG - C -C 

ProLeuThrAsp Ser AsnGlySerGlyPro 

AspGlnArgProTyrCysTrpHisTyrAlaProArgGlnCysGlylleValProAlaSer 
1031 GACCAGAGGCCGTATTGCTGGCACTACGCACCTCGACAGTGTGGTATCGTACCCGCGTCG 

C-C— C— C C-C—AAA—CT— C T — G AA- 

Pro Lys Pro Lys 

GlnValCysGlyProValTyrCysPheThrProSerProValValValGlyThrThrAsp 
1091 CAGGTGTGCGGTCCAGTGTATTGCTTCACCCO^AGCCCTGTTG 

Ser 

ArgPheGlyAlaProThrTyrAsnTrpGlyAspAsnGluThrAspValLeuLeuLeuAsn 
115 1 CGTTTCGGCGCCCCTACGTATAACTGGGGGGACAATGAGACGGACGTGCTGCTCCTAAAC 

A-G-CG G — C — C — C-G T — A T CT-CG T 

Ser Ser Glu Asp PheVal 

AsnThrArgProProHlsGlyAsnTrpPheGlyCysThr 
1211 AACACGCGGCCCCCGCACGGCAACTGGTTCGGCTGTACA 

— T — CA A TG T T 

Leu 
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.lyAsnTrpPheGlyCysThrTrpMat^ 
TGGGCAACTGGTTCGGCTGTACATGGATG AATAGCACTG* SC-TTGACCAAGACGTGCGG AG 

^ , T— C CTCA Ar *val 

ProProCysAsnlleGlyGlyValGlyAsnAsnThrLeuThrCysProThrAspCysPhe 
6 1 GCCCCCCGTGTAACATCGGGGGGGTCGGCAACAACACCTTGACCTGCCCCACGGACTGCT 

CG--T— T GT A CG C — CA T— T 

Val Ala His 

ArgLysThrProThrAiaThrTyrThrLysCysGlySerGlyProTrpLeuThrProArg 
i ->1 T CCGGMGACCCCGACGGCCACTTACAC^AAATGTGGTTCGGGCCCTTGGTTGACACCTA 

C CAT GAC A T-TCGG— C — C — C — T~C A-C C- 

His Asp SerArg Ile 

^ysLeuValAspTyrProTyrArgLeuTrpHisTyrProCysThrValAsnPheThrlle 
' S 1 GGTGCTTGGTTGACTACCG^TACAGGCTCTGGCACTACCCCTGCACTGTCAACTTTACCA 

r c G — T T T — I — T — T — CA AC 

lie Tyr 

PheLy sValArgMetTyrValGlyGly ValGluHi s 
^41 TCTTCAAGGTTAGGATGTATGTGGGGGGCGTGGAGCAC 

-A — T — AA-C C A — G — C — A 

He 
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C200 region Mqueac* va. HCV-1 



— eo AMHttSertysAlaHisGlyThrAapProAanllttArgThr 

ThrX*uGlyPh«GlyAlaTyr * il* 




" ~ C— G G- T A TTTGTGACGAGTGCCACTCC 

. rieCysAspGluCyaHlsSer 

* 

ThrAapAlaThrSerllel^lylleGlyThrValLeuAapGlnAlaGlu^ 
HCV-1 4021 GCGMACTGCTreKCTCGa^ 

AlaArgLeuValVaiLejAlaThrAlaThrProProGlyS«rV«lThrValProHisPro 



C200 Serlle 
HCV-1 4081 ^CA^GGWWrTOCTOTCCACCACa^ 

AsnlleCluCluValMalAuSernjThtGlyCltillBProPhBTTrGlytysiaa 



"- v ~ 1 — g— A-TA G — A —A" 



Vftl 



-J)0 4201 ^GAGCTCGCCGCAAASCTCHX^ 

nCV-1 A -CTC—AT-G — CA C— C 

Val Il« 

FIGURE 16-1 
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l«u^ValSerVallleProThrS«GlyXapValValVaiValAlaThrAap 
C200^ 4261 CTTGATGTGTCCGTCATACCAACTAGCGGAGACGTOT^ 4316 
KCV- 1 C- C— G— C— C — T — T — C C — T — CCTC 



KCV-1 4321 ATGACCGGCTATACCGGCGACTTCGACTCGGTGAIXGACTGCAATAC^ 

XetThxGlyTyrthzClyAspPhftAapSerV^ 



HCV-1 4381 ACAGTCGATTTCAGCCTTGACCCTACCTtCACCATPGAGACAATCACGCTCCCCCAG 

ThrVaOA«pPh*S«rLeuAjpProThrPheThr IleGluThxI ieThrLeuProGlnAsp 



t -1 4441 GCTGTCTCCCGCACTCAACGTCGGGGOUKMTGGCWKWa 

AlaValSexArgThit;inArgArgGlyArgThrClyArgGlyLysProGlyllar7rArg 



KCV-1 4501 TTTGTGGCACCGGGGGAGaCCCCTCCGGCATCTTCGACTa 

PheValAlaPrcsGlyGluArgProS«xGlyMatPheAapS«xS«r/alLeuCyaGluCys 



HCV-1 4561 TATGACGCAGGCTGTGCTTGGTATGAKTCACtXCCGCCSAGM 

TyrAapAlaGlyCyaAlaTrpTyrGluteuThrPrc^ 



KCV-1 4621 GCGTACATGAACACCCCGGGGCTTCCCGTGTGCOU^CCA^ 

AlaTyrMetAanThrProGlyLeuProValC^sGln^ 



_V-1 4681 GTCCTTACAGGOCTCACrCATATAGATGCCCACTTTCTATCCCAGACAAAGCAGAGTGGG 

ValPhaThrGlyLauThrHiallaAapAlaHisPhaLeuSarGlnXhrLytGlnSarGly 



HCV-1 4741 GAGAACCTTCCnACCTGGTAGCGTACCAAGCCACCGTGTGCGCTAGGGCTCAAGCCCCT 

GluAanI*uProTyrIieuValAlaTyrGlnM^ 



HCV-1 4801 CCCCCATCGTGCGACCAGATGTGGAAGTGTTTGATTCGCCTCAAGCCCAC 

ProProSerTrpAspGlnMetTrpLyaC^aJ^uIleJ^ 



HCV-1 4861 CCAACACCCCTGCTATACAGACTCGCCGCTGTT Ci U>AATGAA 

ProThrProl^ulAuTyrArgLeuGlyAlaVaM 

FIGURE 16-2 
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HOT-1 4921 GTCACCAAAIACAICMGACAT^ 

ValThrLyalYrlleMBfeThrCysMetS^ 



KC7-1 4981 GTGCTCGTTGGCGGCGTCCTGSCTGCTTTGGCCCCGTATT^ 

ValI*uValGlyGlyVaiI*uAlaMaLeuAla^ 



31uVal 
: :094 3AASTC 

5041 GTCAXAGTSGGCAGGGTCGTCTTG TCCGGGAAGCCGGCAATCATACCTGACAGG 

VallleValGlyArgValValLeuSerGlyLyaProAlallelleProAapArg 



u200^ 3101 v.TCTACCGAGAGTTCGATGAGATGGAAGAGTGCGCCTCACACCTCCCCTACATCGAACVG 
nCV-i T~TCAG T-A — G S~A 




LysGlxiAlaGluAlaMaAlaProCysGluSezMetHlsAlaSer 
C200 . 5221 AAGCAAGCGGAGGCTGCTGCTCCGTGTGAGTCAATGCACGCCTCGA 
KCV-i CGX— G— A T- ATC--C— TGC— PCCAG-CCA-"-TGGCAA-AACTCGAGACCTTC 

Arg Vallle AlaValGlnThrAsnTrpGlnLysI^uGluThrPhe 



HCV-1 5281 TGGGCGAAGCATATGTGGAACTTCATCAGTGGGATACAATA 

TrpAlaLyaHIsltatTrpAsnPhelleSarGlylleGln 
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MSI sequence vs. HCV-1 

L^lyAsnTrpPheGlyCysThrTi^^ 
r. 1 n 1 GTTGGGCAATTGGTTCGGTTGCACCTGGA^ 

r. -1 -C — T— a C 

Thr 

c , , "CysVallleGlyGly ValGlyAsnAsnThrLeuGlnCysProThrAapCy s 

- x 61 AGCGCCTCCTTGTGTCATCGGAGGGG1TKXJCAACAAC 

KCV- 1 -C C C T — 

Ala His 

, 121 XTTCCGCAAGCATCCGGACGCCACATACKrKX^ 



HC/-1 



— C— A— 



, a , ^Aj^s^ValHisTyrProTyr^ 
™ , 181 CAGGTGCCTGGTCCACTACCCTTATAGGCTTTGGCAT^ 

rICv- 1 - ■ ■ ■ — — 

Asp He 

. , , ^^P^ysValArgMetTyrValGlyGlyValGluHisArgLeuGluValAlaCy 

He He * Ai"*" 0 

TrpThrArgGlyGluArgCysAspLeuAapAspArgAsp 
1 301 CTGGACGCGGGGCGAGCGTTGTGATCTGGACGACAGGGACA 

HCV-1 A C A 

Glu 
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Core sequence vs. HCV-1 



T1 V _ 1 1 GCGTCTAGCCATGGCGTTAGTATGAGTGTCGTGCAGCCTCCAGGACCCCCCCTCCC 
£1 - 7 G GGAGAGCCATAGT GGTCTGCGGAACCGGTGAGTACACCGGAATTGCCAGGACGACCGGG 

*^ — "M M M M mm » M1M M — _MM— mmm^mmmm 

177 T AGCCGA GT A6TGTTGGG TC GCGA AAGGCCTO 

„_ _ MetSerThrAsnProLysProGlnArqLys 

-1 , -37 GTGCCCCGGGAGGTCTCGTAGACCGTGCATCATGAGCACAAAXCCTAAACCTCAAAGAAA 

.-iCV-1 — c q A 

Lys 

V , " 97 -^CCAAACGTAACACCAACCGCCGCCCACAGGACGTCAAGTTCCCGGGCGGTGGTCAGAT 

~A T T _c 

Asn 



u^r i J3 CGTTGGTGGAGTTTACCTGTTGCCGCGCAGGGGCCCCAGGTTGGGTGTGCGCGC^ 
HCV-L T T— A- 



5v i GAAGACTTCCGAGCGGTCGCAACCTCGTGGAAGGCGACAACCTATCCCCA 

HCv-1 A A — T — A — T — G T-G 

Arg 

- n . -m ^H^™i y ^ g ^ aT ^^ aGlnPr ^ 1 y^ rProT ^ProLeuTy 

:JL_, ' GCCCGAGGGCAGGGCCTGGGCTCAGCCCGGGTACCCTTGGCCCCTCTATGGCAACGAGGG 

Thr T 

MetGlyTrpAlaGlyTrpLeu 
T l 537 CATGGGGTGGGCAGGATGGCTCCT 

CV-l -TGC G 

Cys 
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